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CLONING OF RHADINOVERUS GENOME AND METHODS FOR ITS USE 

PRIORITY CLAIM 

5 This application is a continuation of U.S. Provisional Patent Application No. 60/205,652, 

filed May 1 8, 2000. This application is also related to PCT application US99/26260, which claims 
priority from U.S. Provisional Patent Application No. 60/109,409 and U.S. Provisional Patent 
Application No. 60/107,507. All of these applications are incorporated herein incorporated by 
reference. 

10 

FIELD OF THE INVENTION 

This application relates to a Japanese macaque herpesvirus (JMHV), specifically to the 
nucleic acid sequence of open reading frames in the virus and to amino acid sequences encoded by 
these sequences. Compositions and methods are provided for the production of animal models useful 
15 in assessing the efficacy of drugs for the treatment or prevention of conditions associated with 

infection by the virus, such as multiple sclerosis. In addition, methods are provided for the isolation 
of related viruses. 

BACKGROUND 

20 Converging lines of evidence indicate mat Kaposi's sarcoma-associated herpesvirus 

(KSHV) is the etiological agent responsible for Kaposi's sarcoma (KS) in individuals with and 
without HIV infection (Chang et aL, 1994, Science 266:1865-9; Schalling et aL, 1995, Nature Med. 
7:707-8; Moore & Chang, 1995, N. Engl J. Med 332:1 181-5; Whitby et aL, 1995, Lancet 346:799- 
802; Ambrozdak et aL, 1995, Science 268:582-3.; Dupin et aL, 1995, Lancet 345:761-2.; Chuck et al, 

25 1996, J. Infect Dis. 173:248-51; O'Neill etaL, 1996,/. din. Pathol 49:306-8; Gao et aL, 1996, 
Nature Med 2:925-8; Kedes et aL, 1996, Nature Med 2:918-24; Gao et aL, 1996, N. Engl J. Med. 
335:233-41). In addition to KS, KSHV is also responsible for other acquired immunodeficiency 
syndrome (AIDS)-related and non-AIDS-related malignancies, such as primary effusion lymphomas 
(Cesannan et aL, 1995, N Engl J. Med 332:1 186-91; Nador et aL, 1996, Blood 88:645-56; Otsuki et 

30 al, 1996, Leukemia 10:1358-62), and multicentric Castleman's disease (MCD), a B cell proliferation 
disorder associated with overexpression of IL~6 activity (Soulier et al., 1995, Blood 86:1276-80; 
YoshizaM et aL, 1989, Blood 74: 1360-7). 

More recently, KSHV has been proposed to be involved in multiple myeloma, a B cell 
abnormality of monoclonal origin (Rettig et aL, 1997, Science 276:1851-4; Said et aL, 1997, Blood 

35 90:4278-82; Parravicini et al., 1997, Science 278: 1969-70; Masoodetal., 1997, Science 278:1970-1; 
Whitby et aL, 1997, Science 278:1971-2; Cottoni et aL, 1997, Science 278:1972; Brousset et aL, 
1997, Science 278:290-4). Understanding how KSHV is involved in these malignancies is important 
for the generation of therapies against the spectrum of KSHV-associated diseases. 



WO 01/88203 PCT/US01/16274 



-2- 

Testing the efficacy of therapeutics and vaccines against any disease, such as KHSV, is 
greatly facilitated by the availability of an animal model, such as a non-human primate model, 
because non-human primates are physiologically very similar to humans. Although such models 
have been developed for the study of HIV infection (for example, U.S. Patent Nos. 5,212,084 and 
5 5,543,131) none has yet been developed for KSHV infection. 

Infection of animals with some herpesviruses, namely Herpesvirus saimiri and murine 
herpesvirus type 68, can cause a lymphoproliferative disorder (LPD). However, these animals are not 
adequate models of KSHV pathogenesis because they lack certain KSHV genes that may contribute 
to viral pathogenesis or influence HIV infection, such as Interleukin 6 (IL-6) and macrophage 
10 inflammatory protein 1 (MEM) (Albrecht et aL, 1992, /. Virol 66:5047-58; Virgin et aL, 1997, J. 
Virol 71:5894-904). Thus, so far the establishment of a non-human primate model for KSHV 
infection has remained elusive. 

The present invention addresses this problem, and others, in the development of animal 
models for a variety of pathological conditions and diseases. 

15 

SUMMARY OF THE DISCLOSURE 
Rhesus macaques naturally harbor a virus related to KSHV, referred to as RRV, for rhesus 
rhadinovirus. The present disclosure also includes information about pathological conditions 
associated with RRV infection. 
20 Japanese macaques can harbor a virus related to RRV, called Japanese Macaque Virus 

(JMHV). 

An isolated virus is provided (Japanese Macaque Herpesvirus, JMHV) as deposited with 
ATCC as deposit accession number PTA-1 884, deposited May 1 8, 2000, as are viral particles 
including this virus and host cells infected with mis virus. 
25 A purified polypeptide is also provided that includes an amino acid sequence that has at least 

95% sequence identity to an amino acid sequence as set forth as SEQ ID NO:3, SEQ IDNO:5, SEQ 
ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:17, SEQ ED NO:19, SEQ ID 
NO:21, SEQ ID NO:23, SEQ ID NO:25, and SEQ ID NO:27, as are nucleic acid molecules encoding 
these polypeptides. 

30 In another embodiment, a method is provided for testing the efficacy of a drug in the 

treatment of a condition associated with infection with JMHV. The method includes administering 
the drug to a non-human primate infected with an JMHV; and observing the non-human primate to 
determine if the drug prevents or reduces the presentation of one or more symptoms associated with 
JMHV infection. 

35 In another emrxxhment, a method is provided for testing the efficacy of a candidate vaccine 

against JMHV infection, or conditions associated with JMHV infection. The method includes the 
steps of: (a) administering the vaccine to a non-human primate susceptible to infection with the 
JMHV; (b) moculating the subject with the JMHV; and (c) observing the non-human primate to 
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determine if the vaccine prevents or reduces an incidence of JMHV infection or a symptom 
associated with JMHV infection. 

In a former embodiment, a method is provided for detecting the presence of JMHV or a 
related vims in a biological specimen, by amplifying by polymerase chain reaction a JMHV nucleic 
5 acid sequence, or by using hybridization technology, if such sequence is present in the sample. 

In yet another embodiment, a method is provided for obtaining JMHV-related nucleic acid 
sequence using amplification or hybridization. 

A method is also provided for detecting the presence of JMHV in a biological specimen, 
including contacting the biological specimen with an antibody that binds to a JMHV polypeptide, and 
10 detecting binding of the antibody to the biological specimen or a component thereof Binding of the 
antibody to the biological specimen indicates the presence of JMHV. 

Kits are provided that include an antibody that binds to a JMHV polypeptide or an 
oligonucleotide that hybridizes to a JMHV nucleic acid sequence. 

The foregoing and other objects, features, and advantages of the invention will become more 
1 5 apparent from the following detailed description of several examples which proceeds with reference 
to the accompanying figures. 

BRIEF DESCRIPTION OF THE FIGURES 
FIG. 1 shows a phylogenetic comparison of the gammaherpesviruses Epstein-Barr virus 
20 (EBV), Alcelaphine herpesvirus (AHV), Murine herpesvirus (MHV), Herpesvirus sainriri (HVS), 

Kaposi's sarcoma-associated herpesvirus (KSHV), and Rheusus rhadinovirus 17577 (RRV). It shows 
that among the known sequenced viruses, RRV is the closest relative to KSHV, using an accepted 
maximum parsimony method of evaluating evolutionary relationships. 

FIG. 2 is a table showing the BamHL, EcoBl and HindHL restriction fragments of the RRV 

25 genome. 

FIG. 3 is a schematic map of the 75 ORFs of RRV. Arrow direction represents direction of 
transcription. 

FIG. 4 is a table showing the size, location and description (similarity to other proteins) of 
the proteins encoded by the ORFs of RRV. 
3 0 FIG . 5 is a table showing a comparison of corresponding repeats in RRV and KSHV. 

FIG. 6 is a table showing the comparison of interferon regulatory elements encoded by RRV 
and KSHV. 

FIG. 7 is a table comparing the ORFs of RRV, KSHV and HVS. The table shows the start 
and stop points, the strand (+ or -) from which the ORF is transcribed, the size of the ORFs and the 
35 percentage similarity of KSHV and HVS when compared with RRV. 

FIGS. 8A-8D are graphs showing CD20+ lymphocytes, antibody response and RhKSHV 
isolation/detection in macaques infected with SIVmac239 and RRV (A)18483 and (B) 18570 and 
macaques infected with SIVmac239 only (C) 18503 and (D) 18540. A indicates positive for 
virus culture or viral DNA, as defined by PCR and Southern blot analysis; negative for virus 
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culture or viral DNA. 

FIG. 9 shows the result of the PCR analysis ofPBLs and LNMCs from each of the 
macaques (18483, 18503, 18540 and 18570) for RRV DNA and P-globin in (A) graphical form and 
(B) digital form. 

5 FIG. 10 shows the DNA sequence of the RRV ORF that encodes the MBP protein 

(nucleotides 22245-22592 ofSEQ ID NO: 1). 

FIG. 11 shows the DNA sequence of the RRV ORF that encodes the IL-6 protein 
(nucleotides 19921-20544 ofSEQ ID NO 1). 

10 

SEQUENCE LISTING 
The nucleic and amino acid sequences listed in the accompanying sequence listing are 
shown using standard letter abbreviations for nucleotide bases, and the code for amino acids. Only 
one strand of each nucleic acid sequence is shown, but the complementary strand is understood as 

15 included by any reference to the displayed strand. 

SEQ ID NO 1 shows the nucleotide sequence of the RRV genome. RRV Rl corresponds to 
nucleotides 1353-2624 of SEQ ID NO 1 . RRV ORF2 corresponds to the complement of nucleotides 
2692-3258 of SEQ ID NO 1, which encodes dihydrofolate reductase, and which has some similarity 
to Kaposi's sarcoma-associated herpesvirus (KSHV) ORF 2. RRV ORF 4 corresponds to nucleotides 

20 3676-561 3 of SEQ ID NO 1, which encodes complement binding protein, and which has some 

similarity to KSHV ORF 4 RRV ORF 6, corresponds to nucleotides 6045-9443 of SEQ ID NO 1, 
which encodes ssDNA binding protein, and which has some similarity to KSHV ORF 6. RRV ORF 
7 corresponds to nucleotides 9468-1 1528 of SEQ ID NO 1, which encodes a transport protein, and 
which has some similarity to KSHV ORF 7. RRV ORF 8, corresponds to nucleotides 11515-14004 

25 of SEQ ID NO 1, which encodes glycoprotein B, and which has some similarity to KSHV ORF 8. 
RRV ORF 9, DNA polymerase protein, corresponds to nucleotides 14122-17166 of SEQ ID NO 1, 
which has some similarity to KSHV ORF 9. RRV ORF 1 0 corresponds to nucleotides 17261-1 851 1 
of SEQ ID NO 1, which has some similarity to KSHV ORF 10. RRV ORF 1 1 corresponds to 
nucleotides 18520-19749 of SEQ ID NO 1, which has some similarity to KSHV ORF 1 1. RRV R2 

30 corresponds to the complement of nucleotides 1 992 1-20544 of SEQ ID NO 1 , which has some 
similarity to the Kaposi's sarcoma- associated IL-6 gene. RRV ORF 70, myrmdylate synthase, 
corresponds to the complement of nucleotides 20777-21778 of SEQ ID NO 1, and which has some 
similarity to KSHV ORF 70. RRV R3 corresponds to the complement of nucleotides 22245-22592 
of SEQ ID NO 1, which has some similarity to the KSHV K4 viral MIP gene. RRV ORF 16, a Bcl-2 

35 homolog, corresponds to nucleotides 26846-27409 of SEQ ID NO 1, which has some similarity to 
KSHV ORF 16. RRV ORF 17 corresponds to the complement of nucleotides 27515-29125 of SEQ 
ID NO 1, which has some similarity to KSHV ORF 17. RRV ORF 18 corresponds to nucleotides 
28998-29897 of SEQ ID NO 1, which has some similarity to KSHV ORF 18. 

RRV ORF 19 corresponds to the complement of nucleotides 29905-31548 of SEQ ID NO 1, 
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which encodes a tegument protein, and which has some similarity to KSHV ORF 19. RRV ORF 20 
corresponds to the complement of nucleotides 31043-32095 of SEQ ID NO 1, and has some 
similarity to KSHV ORF 20. RRV ORF 21 corresponds to nucleotides 32094-33767 of SEQ ID NO 
1, which encodes a thymidine kinase protein, and which has some similarity to KSHV ORF 21. RRV 
5 ORF 22 corresponds to nucleotides 33754-35868 of SEQ ID NO 1, and encodes a glycoprotein H 
protein, and which has some similarity to KSHV ORF 22. RRV ORF 23 corresponds to the 
complement of nucleotides 35865-37073 of SEQ ID NO 1, which has some similarity to KSHV ORF 
23. RRV ORF 24 corresponds to the complement of nucleotides 37123-39321 of SEQ ID NO 1, and 
which has some similarity to KSHV ORF 24. RRV ORF 25, corresponds to nucleotides 39323- 

1 0 43459 of SEQ ID NO 1, which encodes a major capsid protein, and which has some similarity to 
KSHV ORF 25. RRV ORF 26 corresponds to nucleotides 4349 1^4408- of SEQ ID NO 1, which 
encodes a capsid protein, and which has some similarity to KSHV ORF 26. RRV ORF 27 
corresponds to nucleotides 44433-45242 of SEQ ID NO 1 , and which has some similarity to KSHV 
ORF 27. RRV ORF 28 corresponds to nucleotides 45408-45683 of SEQ ID NO 1, and which has 

15 some similarity to KSHV ORF 28. RRV ORF 29b corresponds to the complement of nucleotides 
45733-46779 of SEQ ID NO 1, and winch has some similarity to KSHV ORF 29b. RRV ORF 30 
corresponds to nucleotides 46905-47135 of SEQ ID NO 1, and which has some similarity to KSHV 
ORF 30. RRV ORF 3 1 corresponds to nucleotides 47093-47746 of SEQ ID NO 1 , and which has 
some similarity to KSHV ORF 3 1 . RRV ORF 32 corresponds to nucleotides 47683-49077 of SEQ 

20 ID NO 1, and has some similarity to KSHV ORF 32. RRV ORF 33 corresponds to nucleotides 
49049-50059 of SEQ ID NO 1, and which has some sirnilarity to KSHV ORF 33. RRV ORF 29a 
corresponds to the complement of nucleotides 49977-50960 of SEQ ID NO 1, and has some 
similarity to KSHV ORF 29a. RRV ORF 34 corresponds to nucleotides 50959-5 1942 of SEQ ID NO 
1, and has some similarity to KSHV ORF 34. RRV ORF 35 corresponds to nucleotides 51923-52372 

25 of SEQ ID NO 1, has some similarity to KSHV ORF 35. RRV ORF 36, corresponds to nucleotides 
52278-53585 of SEQ ID NO 1, which encodes a kinase, and which has some similarity to KSHV 
ORF 36. RRV ORF 37 corresponds to nucleotides 53566-55008 of SEQ ID NO 1, which encodes an 
alkaline exonuclease, and which has some similarity to KSHV ORF 37. RRV ORF 38 corresponds to 
nucleotides 54963-55172 of SEQ ID NO 1, and has some similarity to KSHV ORF 38. RRV ORF 

30 39 corresponds to the complement of nucleotides 55255-56391 of SEQ ID NO 1, which encodes 
glycoprotein M, and which has some similarity to KSHV ORF 39. RRV ORF 40 corresponds to 
nucleotides 56526-57932 of SEQ ID NO 1, which encodes hehcase/prirnase, and which has some 
similarity to KSHV ORF 40. 



35 hehcase/primase, and which has some similarity to KSHV ORF 4 1 . RRV ORF 42 corresponds to the 
complement of nucleotides 58525-59343 of SEQ ID NO 1, which has some similarity to KSHV ORF 
42. RRV ORF 43 corresponds to the complement of nucleotides 59297-61027 of SEQ ID NO 1, 
which encodes a capsid protein, and which has some similarity to KSHV ORF 43. RRV ORF 44 
corresponds to nucleotides 60966-63338 of SEQ ID NO 1, which encodes hehcase/primase, and 



RRV ORF 41, corresponds to nucleotides 57917-58528 of SEQ ID NO 1, which encodes 
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which has some similarity to KSHV ORF 44 . RRV ORF 4, corresponds to the complement of 
nucleotides 63379-64437 of SEQ ID NO 1, and which has some similarity to KSHV ORF 45. RRV 
ORF 46 corresponds to the complement of nucleotides 64479-65246 of SEQ ID NO 1, which 
encodes uracil DNA ghicosidase, and which has some similarity to KSHV ORF 46. RRV ORF 47 
5 corresponds to the complement of nucleotides 65222-6573 1 of SEQ ID NO 1, which encodes 
glycoprotein L, which has some similarity to KSHV ORF 47. RRV ORF 48 corresponds to the 
complement of nucleotides 65999-67168 of SEQ ID NO 1, and which has some similarity to KSHV 
ORF 48. RRV ORF 49 corresponds to the complement of nucleotides 67398-68303 of SEQ ID NO 
1 , and which has some similarity to KSHV ORF 49 . RRV ORF 50 corresponds to nucleotides 68494- 

10 70038 of SEQ ID NO 1, which encodes a transactivator, and which has some similarity to KSHV 

ORF 50. RRV R4 corresponds to nucleotides 70355-70888 of SEQ ID NO 1 . RRV R5 corresponds 
to nucleotides 71468-72160 of SEQ ID NO 1. RRV ORF 52 corresponds to the complement of 
nucleotides 72401-72820 of SEQ ID NO 1, and has some similarity to KSHV ORF 52. RRV ORF 5, 
corresponds to the complement of nucleotides 72884-73198 of SEQ ID NO 1, and has some 

15 similarity to KSHV ORF 53. RRV ORF 54 corresponds to nucleotides 73274-74146 of SEQ ID NO 
1, winch encodes a dUTPase, and which has some similarity to KSHV ORF 54. RRV ORF 55 
corresponds to the complement of nucleotides 74207-74839 of SEQ ID NO 1, and has some 
sirmlarity to KSHV ORF 55. RRV ORF 56 corresponds to nucleotides 74851-77337 of SEQ ID NO 
1, which encodes a DNA replication protein, and has some similarity to KSHV ORF 56. RRV ORF 

20 5, corresponds to nucleotides 77578-78906 of SEQ ID NO 1, which encodes an immediate-early gene 
product, and which has some similarity to KSHV ORF 57. RRV R6 corresponds to the complement 
of nucleotides 79266-80513 of SEQ ID NO 1, which has some similarity to KSHV vIRF K9 gene. 
RRV R7 corresponds to the complement of nucleotides 80686-81933 of SEQ ID NO 1, which has 
some similarity to KSHV vIRF K9 gene. RRV R8 corresponds to the complement of nucleotides 

25 82262-83317 of SEQ ID NO 1, which has some similarity to KSHV vIRF K9 gene. RRV R9 
corresponds to the complement of nucleotides 83491-84252 of SEQ ID NO 1, which has some 
similarity to KSHV vIRF K9 gene. RRV R10, corresponding to the complement of nucleotides 
85052-86209 of SEQ ID NO 1, which has some sirmlarity to KSHV vIRF K9 gene. RRV Rl 1 
corresponds to the complement of nucleotides 86355-87527 of SEQ ID NO 1, which has some 

30 similarity to KSHV vIRF K9 gene. RRV R12 corresponds to the complement of nucleotides 87894- 
88961 of SEQ ID NO 1, which has some similarity to KSHV vIRF K9 gene. RRV R13 corresponds 
to the corr^lement of nucleotides 89122-90216 of SEQ ID NO 1, which has some similarity to KSHV 
vIRF K9 gene. RRV ORF 58 corresponds to the corrrplement of nucleotides 90462-91544 of SEQ ID 
NO l,which has some similarity to KSHV ORF 58. RRV ORF 59 corresponds to the corrrplement of 

35 nucleotides 91555-92739 of SEQ ID NO 1, which encodes a DNA replication protein, and which has 
some similarity to KSHV ORF 59. RRV ORF 60 corresponds to the corrrplement of nucleotides 
92868-93812 of SEQ ID NO 1, which encodes a small ribonucleotide reductase protein, and which 
has some similarity to KSHV ORF 60. RRV ORF 61 corresponds to the complement of nucleotides 
93794-96160 of SEQ ID NO 1, which encodes a large ribonucleotide reductase protein, and which 
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has some similarity to KSHV ORF 61. RRV ORF 62 corresponds to the complement of nucleotides 
96163-97158 of SEQ ID NO 1, which encodes a assembly/DNA maturation protein, and which has 
some similarity to KSHV ORF 62. RRV ORF 63 corresponds to nucleotides 97157-99976 of SEQ 
ID NO 1, which encodes a tegument protein, and which has some similarity to KSHV ORF 63. RRV 
5 ORF 64 corresponds to nucleotides 99980-107626 of SEQ ID NO 1, which encodes a tegument 
protein, and which has some similarity to KSHV ORF 64. RRV ORF 6, corresponds to the 
complement of nucleotides 107637-108146 of SEQ ID NO 1, which encodes a capsid protein, and 
which has some simflarity to KSHV ORF 65. RRV ORF 66 corresponds to the complement of 
nucleotides 108152-109498 of SEQ ID NO 1, which has some similarity to KSHV ORF 66. RRV 
10 ORF 67 corresponds to the complement of nucleotides 109524-1 10198 of SEQ ID NO 1, which 
encodes a tegument protein, and which has some similarity to KSHV ORF 67. RRV ORF 68 
corresponds to nucleotides 110609-111982 of SEQ ID NO 1, which encodes a glycoprotein, and 
which has some similarity to KSHV ORF 68. RRV ORF 69 corresponds to nucleotides 112004- 
112897 of SEQ ID NO 1, which has some similarity to KSHV ORF 69. RRV ORF 71, corresponds 
15 to the complement of nucleotides 1 19211-1 19735 of SEQ ID NO 1, which encodes a FLIP protein, 
and which has some similarity to KSHV ORF 71. RRV ORF 72, corresponds to the complement of 
nucleotides 1 19794-120558 of SEQ ED NO 1, which encodes a cyclin D homolog, and winch has 
some similarity to KSHV ORF 72. RRV ORF 73 corresponds to the complement of nucleotides 
120866-122212 of SEQ ID NO 1, which encodes a latent nuclear antigen, and winch has some 
20 similarity to KSHV ORF 73. RRV R15 corresponds to nucleotides 122866-123627 of SEQ ID NO 1, 
which has some similarity to KSHV K14 and ox-2. RRV ORF 74 corresponds to nucleotides 
123924-124952 of SEQ ID NO 1, which encodes a G protein coupled receptor, and which has some 
similarity to KSHV ORF 74. RRV ORF 75 corresponds to the complement of nucleotides 125057- 
128953 of SEQ ID NO 1, which encodes a tegument protein, FGARAT, and which has some 
25 sirnilarity to KSHV ORF 75. 

SEQ ED NO 2 shows the nucleic acid sequence of the JMHV ORF21. 

SEQ ID NO 3 shows the deduced amino acid sequence of the JMHV ORF21 . 

SEQ ID NO 4 shows the nucleic acid sequence of another region of the JMHV ORF21. 

SEQ ID NO 5 shows the deduced amino acid sequence of a portion JMHVORF2 1 , as 
30 encoded by SEQ ID NO:4. 

SEQ ID NO 6 shows the nucleic acid sequence of a portion of JMHV cosrmd 3 fragment 5 
17, which has similarity to the nucleic acid sequence found in and to the right of RRV repeat unit 
rDL-Bl. 

SEQ ID NO 7 shows the nucleic acid sequence of the JMHV ORF 17, a capsid protein. 
35 SEQ ID NO 8 shows the amino acid sequence of the JMHV ORF 17, deduced from SEQ ID 

NO:7. 

SEQ ID NO 9 shows the nucleic acid sequence of JMHV ORF 21 thymidine kinase, which 
includes frameshirt mutations. 

SEQ ID NO 10 shows the amino acid sequence of a portion of JMHV ORF 21, which is the 
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deduced amino acid sequence of the region between the framseshifts in SEQ ID NO:9. 
SEQ ED NO 11 shows the nucleic acid sequence ofJMHVORF 24. 
SEQ ID NO 12 shows the deduced amino acid sequence of JMHV ORF 24. 
SEQ ID NO 13 shows the nucleic acid sequence of JMHV ORF 7 transport protein. 
5 SEQ ID NO 14 shows the deduced amino acid sequence of die JMHV ORF 7 transport 

protein. 

SEQ ID NO 15 shows the nucleic acid sequence of cosrmd 3, fragment 1, sp6, which is 
similar to the sequence to the left of RRV repeat rDL-BL 

SEQ ID NO 16 shows the nucleic acid sequence of the JMHV ORF10. 
10 SEQ ID NO 17 shows the deduced amino acid sequence of the JMHV ORF 10. 

SEQ ID NO 1 8 shows the nucleic acid sequence of the ORF 9 DNA polymerase. 

SEQ ID NO 19 shows the deduced amino acid sequence of the ORF 9 DNA polymerase. 

SEQ ID NO 20 shows the nucleic acid sequence of JMHV ORF 10. 

SEQ ID NO 21 shows the deduced amino acid sequence of the JMHV ORF 10. 
1 5 SEQ ID NO 22 shows the nucleic acid sequence of the JMHV ORF 8 glycoprotein B . 

SEQ ID NO 23 shows the deduced amino acid sequence of the JMHV ORF 8 glycoprotein 

B. 

SEQ ID NO 24 shows the nucleic acid sequence of the JMHV ORF 9 DNA polymerase. 
SEQ ID NO 25 shows the deduced amino acid sequence of the JMHV ORF9 DNA 
20 polymerase. 

SEQ ID NO 26 shows the nucleic acid sequence of the JMHV ORF8 glycoprotein B. 
SEQ ID NO 27 shows the deduced amino acid sequence of the JMHV ORF 8 glycoprotein 

B. 



25 ATCC DEPOSITS 

A Budapest Treaty deposit of RRV 17577 was made with the American Type Culture 
Collection (ATCC), Manassas, Virginia, on March 12, 1998, and has been accorded ATCC 
Accession No. VR-2601. 

A Budapest Treaty deposit of JMHV 17792 was made with the American Type Culture 
30 Collection (ATCC), Manassas, Virginia, on May 18,2000, and has been accorded ATCC Accession 
No. PTA-1884. 



DETAILED DESCRIPTION OF SEVERAL EMBODIMENTS 
3 5 Abbreviations and Definitions 

Animal: Living multicellular vertebrate organisms, a category which includes, for example, 
humans, non-human primates, mammals, and birds. 

Cell: A plant, animal, insect, bacterial, or fungal cell. 
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Homologs: two nucleotide or amino acid sequences that share a common ancestral 
sequence and diverged when a species carrying that ancestral sequence split into two species. 
Homologs frequently show a substantial degree of sequence identity. 

IL-6: Interleukin 6. IL-6 is a cytokine known to have pleiotropic nnmunological effects 
5 including anti-mflanmnatory and hnrmmo suppressive effects (Human Cytokines, 1991, pg. 142-167, 
Blackwefl Scientific Publications, Aggarwal and Gutterman, eds). Because IL-6 is a pleiotropic 
cytokine, IL-6 activity may be measured using a number of bioassays, inchiding stimulation of 
immunoglobulin production in SKW6-CL4 cells as described by Hirano et aL (Nature 324:73-6, 
1986) and stimulation of hybridoma cell growth as described by Matsuda et aL, 1988 Eur. J. 
1 0 Immunol 1 8:95 1-956, bom of which are incorporated by reference. As used herein, the term "IL-6 
biological activity" refers to the ability of a protein to show activity in at least one of these assay 
systems 

Immunocompromised: Tacking a normal immune response. Iinirflmo-comproimsed 
refers to a condition in which some or all of an animal's immune system is inoperative, leaving the 

15 annual with an increased susceptibility to infection or disease. An animal may be rendered uumuno- 
compromised by a biological agent such as, in the case of non-human primates, Simian 
Tmrnimodeficiency Virus (SIV). Many strains of SIV have been isolated and characterized; any S1V 
strain that produces an imnnmo-cornprormsed state can be used in the present invention including, 
but not limited to, for example, SIVmac239 (Kestler et aL, 1990, Science 248: 1109-12), SIVrrjac251 

20 (Daniels et aL, 1985, Science 228: 1201-4), SIVdeltaB670 (Murphy-Corb et aL, 1986, Nature 

321:435) and SlVmne (Benveniste et aL, 1988,7. Virol 62:2091-101). In addition, hybrid SIV/HIV 
chimeras as known in the field can be employed, as can HIV-2. Simian type D retroviruses (SRVs) 
which cause an AIDS-tike disease in rhesus monkeys, can alternatively be used to imxnuno- 
compromise the smimalg in place of SIV. These viral agents are administered to the animal using 

25 conventional means, such as intravenous or intramuscular injection, or oral, intrarectal or mtravagmal 
inoculation (also see Example 24). Either intact viral particles or viral DNA may be administered. 
As known in the field, plasmid constructs containing the entire SIV genome are infectious when 
inoculated into animals and so may be employed in place of purified viral DNA. 

Alternatively, an animal may be rendered immuno^mpromised by administration of agents 

30 that target the mnnune system, including but not limited to anti-CD3 antibody (CD3 being the T-cell 
receptor) either alone or conjugated with a toxic moiety, or immunosuppressive compounds mchiding 
prednisone, azau^oprine, cyclosporine A, and cyclophosphamide. Where an irnmuno suppressive 
compound such as c^losporine is employed, an allogenic stimulus (such as a blood transfusion) may 
be administered with the subsequent administration of RRV to activate infection. 

35 Alternatively, other methods of rendering an animal Tmrm m rw-coTnpromi zed may be used, 

inchiding radiation treatment and surgical intervention. 

Isolated: An "isolated" biological component (such as a nucleic acid, peptide or protein) 
has been substantially separated, produced apart from, or purified away from other biological 
components in the cell of the organism in which the component naturally occurs, Le., other 
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chromosomal and extrachromDSomal DNA and RNA, and proteins. Nucleic acids, peptides and 
proteins which have been "isolated" thus include nucleic acids and proteins purified by standard 
purification methods. The term also embraces nucleic acids, peptides and proteins prepared by 
recombinant expression in a host cell as well as chemically synthesized nucleic acids. 
5 JMHV : A virus having the virological, immunological or pathological characteristics of 

Japanese Macaque Herpesvirus (e.g. JMHV 17792). JMHV causes the symptoms of MS in Japanese 
macaque monkeys which are infected with the virus. In particular examples, the JMHV has at least 
85% (for example at least 90%, 95% or 98%) sequence identity to the JMHV deposited with die virus 
deposited as ATTC Accession Number PTA- 1 884. In other examples, the JMHV has at least 90%, 

10 91%, 92%, 93%, 94%, 95%, or 98% homology with an RRV (e.g. ATCC VR-2901). Without being 
bound by theory, the JMHV may be derived from an RRV (see the experimental examples). 

JMHV 17792: Japanese macaque herpesvirus isolate A Budapest Treaty deposit of 

JMHV 17792 was made with the American Type Culture Collection, Manassas, Virginia on May 18, 
2001. This virus may be grown on primary Japanese macaque fibroblasts, using standard virological 

15 techniques. Alternatively, it may be grown on conmiercialfy available macaque cell lines. Infection 
of a non-human primate with JMHV 1 7792 can be accomplished using any standard method, 
including intravenous injection. In one embodiment, infection is achieved using 10 6 plaque forming 
units of JMHV 17792. 

KSHV: Kaposi's sarcoma-associated herr^esvirus. KSHV is a herpesvirus associated with 

20 (and thought to be the etiological agent of) Kaposi's sarcoma in humans. 

Lymphoproliferative Disorder: a group of disorders characterized by proliferation of 
lymphoid tissue, such a§ lymphocytic leukemia and malignant lymphoma, and characterized by such 
signs as lymphocytosis, lymphadenopathy, and splenomegaly. 

MIP: macrophage inflammatory protein. The acronym MIP is used to describe a family of 

25 cytokines that includes MIP1 (Davatelis et aL, 1989, Science 243: 1066-8) and MBP2 (U.S. Patent No. 
5,145,676). MIPs mediate pleiotropic immunological effects including activation of neutrophils to 
undergo an oxidative burst MIPs are also intrinsically pyrogenic. MIP biological activity can be 
detected and quantified using bioassays as described in Kedal et aL (Science 277:1656-9, 1997) and 
Boshoff et aL (Science 278:290-4, 1997) that measure MIP concentrations using HIV inhibition and 

30 calcium mobilization, respectively. As used herein, the term "MIP biological activity" refers to the 
ability of a protein to show activity in at least one of these assay systems. 

Multiple Sclerosis: A chronic, progressive disease of the central nervous system. 
Currently, the exact cause of the disease is unknown and there is no cure. Multiple sclerosis refers to . 
multiple areas of patchy scarring, or plaques, that result from demyelination (destruction of myelin, a 

35 fatty insulation covering the nerve fibers). When the myelin sheath is destroyed during the MS 

disease process, signals transmitted throughout the CNS are slowed or disrupted. In many cases, the 
body may compensate for the loss of myelin by increasing the density of the sodium channels so that 
action potentials can continue to be carried, in spite of loss of myelin. The nerves also retain the 
capacity to renryelinate. Unfortunately, the disease process often outpaces these corrective actions. 
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The symptoms, seventy, and course of MS vary widely depending partly on the sites of the 
plaques and the extent of the denryelmatkm. Experts generally group multiple sclerosis into four 
types: relapsing-remitting, primary-progressive, secondary-progressive, and progressive-relapsing 
MS. Relapsing-remitting multiple sclerosis generally occurs in younger people and is the most 
5 common form of MS. Symptoms flare up for several days and then go into remission over the next 
four to eight weeks. The latter three forms (primary-progressive, secondary-progressive, and 
progressive-relapsing MS) generally fall under the category of chronic-progressive MS. In chrarric- 
pTOgressive MS the symptoms of the disease continue to worsen slowly without remission. About 
20% of multiple sclerosis patients (usually those patients whose first symptoms occur after age 45) 

10 have the chrome-progressive form of MS without first developing relapsiiig-remittrng MS. Chronic- 
progressive MS may lead to serious speech problems and paralysis, and generally the symptoms 
continue to worsen over time. 

Multiple sclerosis is defined as an autoimmune disease; that is, the body's mrmiinft system is 
damaged by genetic or environmental factors or both, causing it to attack its own tissues. In the case 

15 of MS, these tissues are the myelin covering the nerve fibers in the brain. 

Although the exact agent that triggers MS is unknown, it has been proposed that viral 
antigens could trigger the autoimmune response in MS. One factor that seems to support this theory 
is the geographical distribution of the disease; the number of MS cases increases the further one gets 
from the equator in either direction. Epidemics of multiple sclerosis also seem to occur. For 

20 example, four separate MS epidemics happened between 1943 and 1989 in the Faroe Islands, located 
between Iceland and Scandinavia. During World War II, this region was occupied by British troops; 
each year for 20 years after the war, the incidence of MS increased, leading some researchers to think 
that the troops might have brought with them some disease-causing agent 

Although several virus have been associated with MS, no one causative agent has been 

25 identified. It has been proposed that HHV-6, a form of heipesvirus, may be the causative agent 

However, other viruses have been implicated in the disease process including herpesviruses 1 and 2, 
varicella-zoster virus, cytomegalovirus, Epstein-Barr virus, measles virus, adenovirus, polyomavirus, . 
and retroviruses, including HIV, HTLV-I, and HTLV-II. It has also been proposed that bacteria may 
be the causative agent In one recent study, 17 patients who were admitted to a hospital and 

30 diagnosed with MS showed signs of infection with the microorganism Chlamydia pneumoniae, an 
atypical bacterium. However, the association of these any of these agents with MS is difficult, as 
infection in the general population is common. 

Genetic factors are also believed to have a role in the development of MS. Children of MS 
patients have an increased risk 30 to 50 times that of the normal population. In addition, the odds of 

35 an MS sibling having the disease are about 20 times higher than in the normal population. A 
significant association between siblings with MS and the specific form of the disease, either 
relapsmg-remitting or chronic progressive has been found. It is possible that a set of genes that 
somehow interact are responsible for the inherited risks. In addition, genetic variations mat occur in 
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different ethnic populations are also believed to result in a variable disease course amongst the 
different ethnic groups. 

A symptom of MS is any physical characteristic associated with the disease process. For 
example, the first symptom of MS is often optic neuritis, the inflammation of the optic nerve. Vision, 
5 usually in one eye, becomes unclear or doubled, and there may be a s hiuu juue rii ig effect Pain or 
nystagmus, involuntary j erking or movement of the eye, may also occur. In 20% of people with mis 
condition, MS develops within two years; in 45% to 80% it develops within 15 years. Other early 
symptoms of multiple sclerosis include fatigue, heaviness or clumsiness in the arms and legs, tingling 
sensations, and poor coordination. Another indication of MS is a reaction known as IJermitte's gign> 

1 0 whereby bending the neck produces an electrical sensation that runs down the back and into the legs. 

As the disease develops over months or even years, other symptoms may include spasticity, 
imbalance, tremors, incontinence, constipation, sexual dysfunction, hearing loss, vertigo, facial pain, 
and difficulties in swallowing. Problems in speech may occur because of difficulty in controlling die 
quality of the voice and articulating words. About half of patients display changes in mental 

15 function, including problems in concentration and problem solving. In about 10% of cases, there is 
severe mental dysfunction which resembles dementia. One of the primary symptoms of MS is 
spasticity, which is characterized by weakness, loss of dexterity, and the inability to control specific, 
movements. * 

In general, MS itself is not fatal, although people who have mis disease generally have a 

20 shorter life span than average. However, the negative emotional impact of this disease and its 
symptoms is considerable. The severity of the disease varies widely from patient to patient; MS 
sometimes remains asymptomatic or becomes only mildly symptomatic even long after initial plaque 
formation. About 20% to 35% of patients have a very mild form of the disease, with little if any 
disability, no need for medication, and a normal life expectancy. At the other extreme, between 3% 

25 and 12% of patients have a very serious, rapidly progressive form of MS. Most patients fall 

somewhere in the middle. Life-threatemng complications may occur as a result of infections in the 
lungs or kidneys. As the muscles mat control breathing weaken, the ability to cough is impaired and 
the patient is at higher risk for pneumonia and other complications in the hmgs. Complications in the 
urinary tract also place the patient at risk for kidney infections. In very severe cases of MS, paralysis 

30 may occur. The severity of disability incurred over a lifetime is unpredictable. 

About 1.1 million people worldwide have multiple sclerosis, and the incidence appears to be 
increasing. Between 250,000 and 350,000 Americans have the disease. Onset of symptoms typically 
occurs between the ages of 15 and 40 years, with a peak incidence in people in their 20s and 30s. 
Women are affected twice as often as men. Multiple sclerosis occurs worldwide but is most common 

35 in Caucasian people of northern European origin, especially those of Scottish descent It is extremely 
rare among Asians and Africans. In general, MS is more prevalent in temperate regions of the world 
than in the tropics. It is unclear whether this pattern is attributable to environmental factors or to 
genetics. A family history of the disease also puts people at risk, although the risk for someone 
inheriting all the genetic factors contributing to MS is only about 2% to 4%. 
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Myelin: A coating of nerve cells (neurons) made from layers of cell membranes mat are 
produced in the brain and spinal cord by specialized cells called oligodendrocytes. Myelin coats lie 
in segments along the axons, the long filaments mat carry electric impulses away from a nerve cell 
The segments are separated from each other by tiny clusters called nodes of Ranvier, which house 
5 channels for sodium ions. These sodium ions are important for boosting the electrical charge * 
required to pass signals from one nerve to another. 

Non-human primate: Simian primates inchiding chimpanzees, orangutans, baboons, and 
macaques. Any non-human primate may be used to produce a KSHV-disease animal model or a 
JMHV-disease animal model by the methods disclosed herein. Thus, in addition to the rhesus 
1 0 macaque and Japanese Macaque models described in detail below, pigtail and cynomologus 

macaques and baboons may also be used to produce KSHV-disease or JMHV-disease animal models 
by the methods disclosed herein. 

Oligonucleotide: A linear polynucleotide sequence of up to about 200 nucleotide bases in 
length, for example a polynucleotide (such as DNA or RNA) which is at least 6 nucleotides, for 
1 5 example at least 15, 25, 50, 100 or even 200 nucleotides long. 

Operably linked: A first nucleic acid sequence is operably linked with a second nucleic 
acid sequence when the first nucleic acid sequence is placed in a functional relationship with the 
second nucleic acid sequence. For instance, a promoter is operably linked to a coding sequence if the 
promoter affects the framcription or expression of the coding sequence. Generally, operably linked 
20 DNA sequences are contiguous and, where necessary to join two protein coding regions, in the same 
reading frame. 

ORF: open reading frame. Contains a series of nucleotide triplets (codons) coding for 
amino acids without any termination codons. These sequences axe usually translatable into protein. 
PGR: polymerase chain reaction. Describes a teclmique in which cycles of denaturation, 
25 annealing with primer, and then extension with DNA polymerase are used to amplify the number of 
copies of a target DNA sequence. 

Pharmaceutical^ acceptable carriers: The pharmaceutically acceptable carriers useful in 
this invention include conventional carriers. Remington's Pharmaceutical Sciences, by E. W. Martin, 
Mack Publishing Co., Easton, PA, 15th Edition (1975), describes compositions and formulations 
30 suitable for pharmaceutical delivery of the viruses, nucleic acids and/or proteins herein disclosed 
In general, the nature of the carrier will depend on the particular mode of adrninistration 
being employed For instance, parenteral formulations usually comprise injectable fluids that include 
pharmaceutically and physiologically acceptable fluids such as water, physiological saline, balanced 
salt solutions, aqueous dextrose, glycerol, ethanol, combinations thereof^ or the like, as a vehicle. 
35 The carrier and composition can be sterile, and the formulation suits the mode of administration. For 
solid compositions (e.g., powder, pill, tablet, or capsule forms), conventional non-toxic solid carriers 
can include, for example, pharmaceutical grades of mannitol, lactose, starch, sodium saccharine, 
cellulose, magnesium carbonate, or magnesium stearate. In addition to biologically-neutral carriers, 
pharmaceutical compositions to be administered can contain minor ar^nnTite of non-toxic auxiliary 
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substances, such as wetting or emulsifying agents, preservatives, and pH buffering agents and the 
like, for example sodium acetate or sorbitan monolaurate. 

The composition can be'a liquid solution, suspension, emulsion, tablet, pill, capsule, 
sustained release formulation, or powder. Hie composition can be formulated as a suppository, with 
S traditional binders and carriers such as triglycerides. 

Probes and primers: Nucleic acid probes and primers may readily be prepared based on 
the amino acid sequences provided by mis invention. A probe is an isolated nucleic acid attached to a 
detectable label or reporter molecule. Typical labels include radioactive isotopes, ligands, fluorescent 
molecules, chetmhmiinescent molecules, and enzymes. In other embodiments, labels include co- 

10 factors, enzyme substrates; and haptens. 

Methods for labeling and guidance in the choice of labels appropriate for various purposes are 
discussed, e.g., in Sambrook et aL, in Molecular Cloning: A Laboratory Manual, Cold Spring (1989) 
and Ausubel et aL. in Current Protocols in Molecular Biology. Greene Publishing Associates and 
Wiley-Intersciences (1987). 

1 5 Primers are short nucleic acids, such as DNA oligonucleotides 1 0 nucleotides or more in 

length. Primers may be annealed to a complementary target DNA strand by nucleic acid 
hybridization to form a hybrid between the primer and the target DNA strand, and then extended 
along the target DNA strand by a DNA polymerase enzyme. Primer pairs can be used for 
amplification of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR) or other 

20 nucleic-acid amplification methods known in the art 

Methods for preparing and using probes and primers are described, for example, in 
Sambrook et al. (Molecular Cloning: A Laboratory Manual, 2 nd ed, voL 1-3, Cold Spring Harbor, 
New York. 1989); Ausubel et aL (Current Protocols in Molecular Biology. Greene PubL Assoc. & 
Wiley-Intersciences. 1987) and Imris et aL (PCR Protocols, A Guide to Methods and 

25 Applications. 1990. Lmis etaL(eds.), 21-27, Academic Press, mc., San Diego, California). PCR 
primer pairs can be derived from a known sequence, for example, by using computer progr am s 
intended for that purpose such as Primer (Version 0.5, © 1991, Whitehead Institute for Biomedical 
Research, Cambridge, MA). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 

30 nucleotides of the RRV genome sequence (SEQ ID NO 1), or 15 contiguous nucleotides of a JMHV 
sequence. One of skill in the art will appreciate that the specificity of a particular probe or primer 
increases with its length. Thus, for example, a primer comprising 20 consecutive nucleotides will 
anneal to a target with a higher specificity than a corresponding primer of only 15 nucleotides. Thus, 
in order to obtain greater specificity, probes and primers may be selected that comprise 20, 25, 30, 35, 

35 40, 50 or more consecutive nucleotides. In order to enhance specificity, longer probes and primers 
may also be employed, such as probes and primers that comprise at least 20, 30, 40, 50, 60, 70, 80, 
90, 100, or 150 consecutive nucleotides of the disclosed nucleic acid sequences. 

Alternatively, such probes and primers may comprise at least 15, 20, 30, 40, 50, 60, 70, 80, 
90, 100, or 150 consecutive nucleotides that share a defined level of sequence identity with the 
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disclosed RRV or JMHV sequence, for instance, at least a 60%, 70%, 80%, 90%, 95% ox 98% 
sequence identity. Alternatively, such probes and primers may be nucleotide molecules which 
hybridize under wash conditions of 70°C and about 0.2 x SSC for 1 hour, or alternatively under less 
stringent conditions of 65°C, 60°C, or 55°C with from about 02 to 2 x SSC (with, for instance, about 
5 0J% SDS) for 1 hour with a portion of the RRV sequence. 

Purified: The term purified does not require absolute purity; rather, it is intended as a 
relative term. Thus, for example, a purified peptide preparation is one in which the peptide or protein 
is more enriched than the peptide or protein is in its natural environment within a cell. Preferably, a 
preparation is purified such that the protein or peptide represents at least 50% of the total peptide or 
1 0 protein content of the preparation. 

Recombinant: A recombinant nucleic acid is one mat has a sequence that is not naturally 
occurring or has a sequence that is made by an artificial combination of two otherwise separated 
segments of sequence. This artificial combination is often accomplished by chemical' synthesis or, 
more commonly, by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic 
1 5 engineering techniques. 

RRV 17577: Rhesus macaque rhadinovirus RRV isolate 17577. A Budapest Treaty deposit 
of RRV 17577 was made with the American Type Culture Collection, Manassas, Virginia, on March 
12, 1998, and has been accorded ATCC Accession No. VR-2601. This virus may be grown on 
primary rhesus fibroblasts, as described below (see Examples 1 and 14), using standard virological 
20 techniques. Alternatively, it may be grown on commercially available rhesus cell lines, including 
those available from ATCC, such as ATCC CRL-6306 and ATCC CL-160. Infection of anon- 
human primate with RRV 17577 may be accomplished using any standard method, including 
intravenous injection (see Examples 13, 23 and 24). Typically, infection is achieved by intravenous 
injection of around 10 6 plaque forming units (PFUs) of RRV 17577. 
25 RRV: A virus having the virological and irmnunological characteristics of RRV 17577, and 

which causes Kaposi's sarcoma in mmnmocompromised Rheusus monkeys which are infected with 
the virus. In particular examples, the RRV has at least 85% (for example at least 90%, 95% or 98%) 
sequence identity to SEQ ID NO 1. 

30 Sequence Identity: Hie similarity between two nucleic acid sequences, or two amino acid 

sequences, is expressed in terms of the similarity between the sequences, otherwise referred to as 
sequence identity. Sequence identity is frequently measured in terms of percentage identity (or 
similarity or homology); the higher the percentage, the more similar the two sequences are. 
Homologs or orthologs of nucleic acid or amino acid sequences will possess a relatively high degree 

35 of sequence identity when aligned using standard methods. This homology will be more significant 
when toe orthologous proteins or nucleic acids are derived from species which are more closely 
related (eg., human and chimpanzee sequences), compared to species more distantly related (eg., 
human and C. elegans sequences). Typically, orthologs are at least 50% identical at the nucleotide 
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level and at least 50% identical at the amino acid level when comparing human orthologous 
sequences. 

Methods of alignment of sequences for comparison are well known. Various programs and 
alignment algorithms are described in: Smith & Waterman, Adv. Appl Math. 2:482, 1981; 
5 Needleman & Wunsch, /. Mol Biol 48:443, 1970; Pearson & Iipman, Proa Natl Acad. Sci USA 
85:2444, 1988; Higgins & Sharp, Gene, 73:237-44, 1988; Higgins & Sharp, CABIOS 5:151-3, 1989; 
Carpet et aJ. 9 Nuc Acids Res. 16:10881-90, 1988; Huang et al. Computer Appls. BioscL 8, 155-65, 
1992; and Pearson et aL,Meth. Mol Bio. 24307-31, 1994. Altschul et aL, J. Mol Biol 215:403-10, 
1 990, presents a detailed consideration of sequence alignment methods and homology calculations. 

1 0 The NCBI Basic Local Alignment Search Tool (BLAST) (Altschul et at, J. Mol Biol 

215:403-10, 1990) is available from several sources, including the National Center for Biotechnology 
Information (NCBI, Bethesda, MD) and on the Internet, for use in connection with the sequence 
analysis programs blastp, blastn, blastx, tblastn and tblastx. Each of these sources also provides a 
description of how to determine sequence identity using this program. 

1 5 Homologous sequences are typically characterized by possession of at least 60%, 70%, 75%, 

80%, 90%, 95% or at least 98% sequence identity counted over the full length alignment with a 
sequence using the NCBI Blast 2.0, gapped blastp set to default parameters. Queries searched with 
the blastn program are filtered with DUCT (Hancock and Armstrong, Comput Appl BioscL 1 0:67-70, 
1994). It will be appreciated that these sequence identity ranges are provided for guidance only; it is 

20 entirely possible that strongly significant homologs could be obtained that fall outside of the ranges 
provided. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences, due to the degeneracy of the genetic code. It is understood that 
changes in nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
25 sequences mat all encode substantially the same protein. 

One indication that two nucleic acid sequences are substantially identical is that the 
polypeptide encoded by the first nucleic acid is immunologically cross reactive with the polypeptide 
encoded by the second nucleic acid. 

An alternative indication that two nucleic acid molecules are closely related is that the two 
30 molecules hybridize to each other under stringent conditions, as described under "specific 
hybridization." 

Homologs of the disclosed KRY or JMHV nucleic acids typically possess at least 50% 
sequence identity counted over the length of one of the nucleic acids (the reference nucleic acid) 
using the NCBI Blast 2.0.6, gapped blastn set to default parameters. Nucleic acids showing 
3 5 substantial similarity when assessed by this method may show, for example, at least 50%, 60%, 70%, 
80%, 90%, 95% or even 98% or greater sequence identity. When less than the entire sequence is 
being compared for sequence identity, substantially similar nucleotide sequences will typically 
possess at least 70% sequence identity over short windows of 30-90 nucleic acids, and may possess 
sequence identities of at least 80%, 90%, 95% or 98% or greater. 
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Homologs of the disclosed RRV or JMHV proteins typically possess at least 50% sequence 
identity counted over full-length alignment with the amino acid sequence of RRV using the NCBI 
Blast 2.0, gapped blastp set to default parameters. For comparisons of amino acid sequences of 
greater man about 30 amino acids, the Blast 2 sequences function is employed using the default 
5 BLOSUM62 matrix set to default parameters, (gap existence cost of 1 1, and a per residue gap cost of 
1). When aligning short peptides (fewer than around 30 amino acids), the alignment should be 
performed using the Blast 2 sequences function, employing the PAM30 matrix set to default 
parameters (open gap 9, extension gap 1 penalties). Proteins with even greater similarity to the 
reference sequence will show increasing percentage identities when assessed by this method, such as 

10 at least 50%, at least 55%, at least 60%, at least 70%, at least 75%, at least 80%, at least 90%, at least 
95%, at least 98%, or at least 99% sequence identity. When less than the entire sequence is being 
compared for sequence identity, homologs will typically possess at least 70% sequence identity over 
short windows of 10-20 amino acids, and may possess sequence identities of at least 75%, at least 
85% or at least 90%, at least 95% or 98% depending on their similarity to the reference sequence. 

1 5 When comparing degrees of sequence identity between similar proteins, the degree of 

identity will be equal to or less than that the degree of similarity, due to the fact the similarity takes 
into account conservative amino acid substitutions. So, for instance, the degree of sequence identity 
between to substantially similar proteins may be at least 50%, 55%, 65%, 75%, 85%, 95%, 98% or 
more. 

20 One of ordinary skill in the art will appreciate that these sequence identity ranges are 

provided for guidance only; it is entirely possible that strongly significant homologs could be 
obtained mat fell outside of the ranges provided. The present invention provides not only the peptide 
homologs that are described above, but also nucleic acid molecules that encode such homologs. 

An alternative indication that two nucleic acid molecules are closely related is that the two 

25 molecules hybridize to each other under stringent conditions, as described in Example 23. 

Specific binding agent An agent that binds substantially only to a defined target As used 
herein, the term "RRV peptide specific binding agent" includes anti-RRV peptide antibodies and 
other agents that bind substantially only to the RRV peptide. Such "peptide specific binding agents" 
include anti-IL-6 and anti-MIP antibodies. Similarly, the term "JMHV peptide specific binding 

30 agent" includes anti-JMHV peptide antibodies and other agents that bind substantially only to the 
JMHV peptide. Hie antibodies may be monoclonal or polyclonal antibodies that are specific for an 
RRV or a JMHV peptide, as well as immunologically effective portions ("fragments") thereof 

In one embodiment, the antibodies used in the present invention are monoclonal antibodies 
(or immunologically effective portions thereof) and may also be humanized monoclonal antibodies 

35 (or immunologically effective portions thereof). Immunologically effective portions of monoclonal 
antibodies include Fab, Fab', F(ab% Fabc and Fv portions (for a review, see Better and Horowitz, 
Methods. ErtzymoL 178:476-96, 1989). Anti-inhibitory peptide antibodies may also be produced 
using standard procedures described in a number of texts, including Antibodies, A Laboratory 
Manual by Harlow and Lane, Cold Spring Harbor Laboratory (1988). 
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Methods of making humanized monoclonal antibodies are well known, and include those 
described in U.S. Patent Nos. 5,585,089; 5,565,332; 5,225,539; 5,693,761; 5,693,762; 5,585,089; and 
5,530,101 and references cited therein. Similarly, methods of making and using immunologically 
effective portions of monoclonal antibodies, also referred to as antibody fragments, are well known 
5 and include those described in Better and Horowitz, 1989, Metk Ertzymol 178: 176-496; Better et aL, 
1990, Better and Horowitz, 1990, Advances in Gene technology: The Molecular Biology of Immune 
Disease & fl ^ TTmmmfi Response (ICSU Short Reports); Glockshnber et aL, 1990, Biochemistry 
29:1362-7; and U.S. Patent Nos. 5,648,237; 4,946,778 and 5,455,030, and references cited therein. 
' The determination that a particular agent binds substantially only to an RRV peptide or a 
10 JMHV peptide may readily be made by using or adapting routine procedures. One suitable in vitro 
assay makes use of the Western blotting procedure (described in many standard texts, mcfuding 
Antibodies, A Laboratory Manual by Harlow and Lane). Western blotting may be used to detennine 
mat a given RRV peptide binding agent, such as an anti-IL-6 or MIP peptide monoclonal antibody, 
binds substantially only to the specific RRV protein. 
15 Specific hybridization: Specific hybridization refers to the binding, duplexing, or 

hybridizing of a molecule only or substantially only to a particular nucleotide sequence when mat 
sequence is present in a complex mixture (e.g. total cellular DNA or RNA). Specific hybridization 
may also occur under conditions of varying stringency. 

Supernatant: The culture medium in which a cell is grown. The culture medium includes 
20 material from the cell If the cell is infected with a virus, the supernatant can include viral particles. 

Subject: This term includes both human and non-human subjects. Sirrrilarly, the term 
"patient" includes both human and veterinary subjects. 

Transformed: A transformed cell is a cell into which has been introduced a nucleic acid 
molecule by molecular biology techniques. As used herein, the term transformation encompasses all 
25 techniques by which a nucleic acid molecule might be introduced into such a cell, including 

transfection with viral vectors, transformation with plasrrrid vectors, and introduction of naked DNA 
by electFoporation, lipofection, and particle gun acceleration. 

Transgenic Cell: Transformed cells which contain foreign, non-native DNA. 
Variants of Amino Acid and Nucleic Acid Sequences: The production of RRV or JMHV 
30 proteins can be accomplished in a variety of ways (for example see Examples 17, 21 and 25). DNA 
sequences which encode the protein, or a fragment of the protein, can be engineered such that they 
allow the protein to be expressed in eukaryotic cells, bacteria, insects, and/or plants. In order to 
accomplish this expression, the DNA sequence can be altered and operably linked to other regulatory 
sequences. The final product, which contains the regulatory sequences and the therapeutic protein, is 
35 referred to as a vector. This vector can then be introduced into the eukaryotic cells, bacteria, insect, 
and/or plant Once inside the cell the vector allows the protein to be produced. 

One of ordinary skill in the art will appreciate that the DNA can be altered in numerous 
ways without affecting the biological activity of the encoded protein. For example, PCR may be used 
to produce variations in the DNA sequence which encodes RRV or JMHV proteins, such as EL-6 or 
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MIP. Such variants may be variants that are optimized for codon preference in a host ceil mat is to be 
used to express the protein, or other sequence changes that facilitate expression. 

Two types of cDNA sequence variant may be produced In the first type, the variation in the 
cDNA sequence is not manifested as a change in the amino acid sequence of the encoded 
5 polypeptide. These silent variations are simply a reflection of the degeneracy of the genetic code. In 
the second type, the cDNA sequence variation does result in a change in the amino acid sequence of 
the encoded protein. In such cases, the variant cDNA sequence produces a variant polypeptide 
sequence. In order to preserve the functional and irnrmmologic identity of the encoded polypeptide, it 

is pre fe rred that any such arnino acid substitutions are conservative. Conservative substitutions 

v 

10 replace one amino acid with another amino acid that has some homology in size, hydrophobicity, etc. 
Such substitutions generally are conservative when it is desired to finely modnlate the characteristics 
of the protein. For example, conservative substitutions generally maintain (a) the structure of the 
polypeptide backbone in the area of the substitution, for example, as a sheet or helical conformation, 
(b) the charge or hydrophobicity of the molecule at the target site, or (c) the bulk of the side chain. 

1 5 Examples of amino acids which may be substituted for an original amino acid in a protein 

and which are regarded as conservative substitutions include: Ser for Ala; Lys for Arg; Gm or His for 
Asn; Ghi for Asp ; Ser for Cys; Asn for Gm; Asp for Gm; Pro for Gly; Asn or Gin for His ; Leu or Val 
for He; Be or Val for Leu; Arg or Gin for Lys ; Leu or lie for Met; Met, Leu or Tyr for Phe; Thr for 
Ser; Ser for Thr, Tyr for Tip ; Tip or Phe for Tyr, and lie or Leu for VaL 

20 The substitutions which in general are expected to produce the greatest changes in protein 

properties will be non-conservative, for instance changes in which (a) a hydrophilic residue, e.g., 
seryl or threonyl, is substituted for (or by) a hydrophobic residue, e.g., leucyl, isoleucyl, phenylalanyl, 
varyl or alanyl; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue 
having an electropositive side chain, e.g., lysyl, arginyl, or histadyl, is substituted for (or by) an 

25 electronegative residue, e.g., glutamyl or aspartyi; or (d) a residue having a bulky side chain, e.g., 
phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine. 

Variations in the DNA sequence that result in annuo acid changes, whether conservative or 
not, should be minimized in order to preserve the functional and irnmunologic identity of the encoded 
protein, The immunologic identity of the protein may be assessed by deterrnrning whether it is 

30 recognized by an antibody to an KRV (or JMHV) protein; a variant that is recognized by such an 
antibody is immunologically conserved. Any DNA sequence variant will preferably introduce no 
more than 20, and preferably fewer than 10 arnino acid substitutions into the encoded polypeptide. 
Variant amino acid sequences can, for example, be 80%, 90%, 95% or even 98% identical to the 
native arnino acid sequence. 

35 Vector: A nucleic acid molecule as introduced into a host cell, thereby producing a 

transformed host celL A vector may include nucleic acid sequences that perrnit it to replicate in the 
host cell, such as an origin of replication. A vector may also include one or more selectable marker 
genes and other genetic elements known in the art 
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Vlrion: A complete viral particle including envelope, capsid (if any), and nucleic acid 
elements. 

The present invention utilizes standard laboratory practices for the cloning, manipulation 
and sequencing of nucleic acids, purification and analysis of proteins and other molecular biological 
5 and biochemical techniques, unless otherwise stipulated. Such techniques are explained in detail in 
standard laboratory manuals such as Sambrook et aL (Molecular Cloning: A Laboratory Manual 2 nd 
ed, voL 1-3, Cold Spring Harbor, New York. 1989) and Ausubel et aL (Current Protocols in 
Molecular Biology, Greene PubL Assoc. & Wfley-Xntersciences. 1987). 

EXAMPLE 1 
Isolation of RRV 

This example describes how RRV was isolated from a rhesus macaque monkey. Fresh, 
dispersed bone marrow (BM) cells were isopynic gradient-purified (FicoH-Paque, Pharmacia) from a 
2 yr, 202 day old captive-reared rhesus macaque that was euthanized 5 03 days after intravenous 
infection with an SJVmac239 variant Gradient-purified BM mononuclear cells were seeded into T- 
25 culture flasks and grown in the presence of Endothelial SFM media (GIB CO) supplemented with 
10% fetal bovine serum, 1% L-glutamine, 1% penicillin-sti^tomychi-neomycin and 30 ug/mL 
endothelial cell growth supplement 

Cultures developing cytopathic effects (CPE) were rapidly frozen in liquid N 2 and thawed, 
and supernatants clarified by centrifiigation and filtered through a 0.45 u membrane. Filtered extracts 
were then used as inoculum on primary rhesus macaque fibroblast cultures. Fibroblast cultures 
developing CPE were scraped free into medurm, pelleted at 400 xg, washed in phosphate-buffered 
saline and suspended in cold Ito and Kamovsky's fixative (2.5% glutaraldehyde, 0.5% picric acid, 
1.6% paraformaldehyde, 0.005% ruthenium red) in 0.1 M sodium cacodylate buffer, pH 7.4 for 2 
hours. Fixed cells were washed in cacodylate buffer, post-fixed in 1% Os0 4 and 0.8% ICjFe (Cn) 6 in 
cacodylate buffer for 1 hour> rinsed in distilled H 2 0 and pore-stained in 2% aqueous uranyl acetate for 
1 hour. Fixed and pre-stained cells were dehydrated in a graded series of acetone imbedded in Epon 
8 12 epoxy resin, polymerized at 60°C and sectioned at 60 nm on an MT 5000 ultramicrotome. 
Copper grid mounted sections were stained with lead citrate and Uranyl acetate and viewed on a 
Phillips 300 election microscope. 

By electron microscopy, numerous herpesvirus particles were observed in the cells. This 
macaque developed LPD characterized as lymphocytic masses in myeloid and nonrymphoid tissues 
which were confirmed histopathologically as solid pleomorphic lymphoid masses. 

35 EXAMPLE 2 

Initial Characterization of RRV 
Infectious virus was purified from infected primary rhesus fibroblast cultures exhibiting 
100% CPE (see Example 1). Infected cells were harvested and disrupted by freeze-thawing and the 
cell debris removed by low speed centrifugation. Supernatants were centrifuged in a Beckman JA-14 
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rotor for 1 hour at 9000 ipm to pellet the virus, which was further purified through a six-step sorbitol 
gradient ranging from 20 to 70%, spun in a Beckman SW41 rotor for 2 hours at 18,000 rpm. Virus 
was diluted in balanced buffered salts solution and then pelleted through a 20% sorbitol cushion. 
Pelleted virus was resuspended in Tris-EDTA buffer (TE; 1 0 mM Tris-HCl, pH 8.0, and 1 mM 
5 EDTA) and rysed in TE with 0.6% SDS and proteinase K (200 ug) at 37°C for 5 hours. ViralDNA 
was then isolated by CsC^ gradient centrifugation in a B eckman Ti 75 rotor at 38.4 K rpm for 72 
hours, collected and dialyzed against TE. 

The viral DNA was analyzed using degenerate primer polymerase chain reaction (PGR) 
amplification and Southern blot hybridization with a probe specific for the KSHV tnymidyiate 

10 synthase (TS) gene: The probe itself had the sequence of the KSHV TS (Orf 70 gene). Four genes 
were identified by these methods. A fragment encoding a portion of the viral DNA polymerase was 
obtained and DNA sequence analysis revealed that the virus was most likely a gamma herpesvirus, as 
arnino acid sequence identity was highest among this class of herpesviruses. DNA sequence analysis 
of the viral DNA fragment found to hybridize to the KSHV TS probe revealed three open reading 

15 frames (ORFs) with homology to KSHV (Nicholas et aL, 1997, Nature Med 3:287-92; Russo et al; 
1996, Proc Natl Acad Sci USA 93:14862-7). One ORF encodes a homologue of macrophage 
inflammatory protein MIP-1 with amino acid sequence identity with KSHV MIP-II, the second ORF 
encodes a myrrridylate synthase homologue and the third ORF encodes a homologue of interleukin-6 
(IL-6) with homology to the rhesus IL-6 and KSHV IL-6. The presence of an IL-6-like cytokine and 

20 an MEP-l-like CC^chernokine flanking TS resembles the genomic organization of KSHV, indicating 
this virus is related to KSHV and is referred to herein as rhesus rbadino virus (RRV). 

To determine if RRV is present in tissue containing the lymphocytic masses, oligonucleotide 
PCR primers specific for the RRV MIP gene (see PCT Application No. US 99/26260) were designed 
in an attempt to detect viral DNA in tissue from the macaque. By semi-quantitative PCR analysis, 

25 viral DNA sequences were detected in DNA samples from bone marrow at approximately 590 copies 
per 0.1 ug of tissue DNA Because rhesus macaques held in captivity are reported to be naturally 
infected with a herpesvirus similar to KSHV, bone marrow DNA samples were isolated from normal 
and SIVrmc239-mfected macaques without LPD and analyzed by PCR- There was no evidence of 
viral DNA sequences. Additionally, since simian Epstein-Barr virus (EBV) has been found to be 

3 0 present in high copy number in lymphomas from SIV-infected macaques (Baskin et aL, 1 986, /. Natl 
Cancer Inst 77:127-39; FeichtingeretaL, 1990,. Amer. J. Pathol 137:1311-5), me tissue samples 
from the macaque with disease were also analyzed by PCR for rhesus EBV (RhEB V) using 
oligonucleotide primers for RhEB V latent membrane protein 1 . By this analysis, no signal for 
RhEB V was detected, suggesting that the RRV may be a connibuting factor for LPD in this SIV- 

35 infected macaque. 
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EXAMPLE3 
Preparation of RRV DNA for Cloning 
Primary rhesus fibroblasts grown in two 850 cm 2 roller bottles were infected with RRV at an 
MOIofO.l and the virus was harvested from the culture supernatant and the infected monolayers 10 
S to 12 days post-infection. Cellular debris was removed from the culture supernatant by ceritrifugation 
at 1,000 x g for 1 0 minutes. Intracellular virus particles were released by somcation followed by 
centrrrogation to pellet debris. 

The two clarified supernatants were then combined and the virus was pelleted by 
centrifugation at 12,500 x g for 1 hour at 4°C, and further purified through a six-step sorbitol gradient 
10 ranging from 20 to 70%. Gradients were cenrrifuged in a Beckman SW41 rotor for 2 hours at 18,000 
rpm at 4°C. Hie interface containing the virus was collected and diluted with cold buffered saline 
solution. The virus was then pelleted by centrifugation in the SW41 for 50 minutes at 18,000 rpm. 
The virus pellet was resuspended in 92 ml of TE (see Example 2) before the addition of 0.6 ml of 
10% sodium dodecylsulfate (SDS) and 0 2 ml of proteinase K ( 1 0 rng/mL) to release the viral DNA. 
1 5 Viral DNA was isolated by CsCl 2 gradient centrifugation in a Beckman Ti75 rotor at 38,400 rpm for 
72 hours, collected, and diafyzed against TE. 

To ensure that the DNA isolated contained all the necessary sequences required for RRV 
replication, DNA was transfected, in duplicate, into primary rhesus fibroblasts by the calcium 
phosphate method without dimethyl sulfoxide shock and observed for cytopathic effects (CPE). 
20 Control transfections, lacking viral DNA or calcium phosphate, did not develop CPE. 

EXAMPLE 4 
Construction of the Cosmid Library 
Approximately 100 ug of purified RRV DNA (Example 3) was partially digested with 
25 Sau3A L Aliquots taken at various time points were run on a 0.5% agarose gel and examined for the 
fraction which gave the desired range of fragments (30 - 42 kb). The selected fraction was 
dephosphorylated with calf mtestinal alkaline phosphatase and 1 ug ligated into the cosmid vector 
SuperCos 1, prepared essentially as described by the manufacturer (Stratagene, La Jolla, CA). The 
resulting ligation product was packaged using GigaPack II Gold packaging extract (Stratagene) and 
30 grown for the isolation of recombinant cosmids. 

Individual recombinant cosmids were grown in 3 ml cultures and the cosmid DNA was 
isolated by alkaline lysis. Cosmid DNA was digested with EcoRl and the DNA fragments separated 
on a 0.8% agarose geL The separated fragments were transferred to nitrocellulose and probed with 
various PCR amplification products corresponding to specific KSHV ORFs. Hybridization of the 
35 probes to the transferred recombinant cosmids was done under conditions of moderate stringency (2x 
SSC-0.1%SDS at 55°C) with each of the KSHV-specific probes and at high stringency (0.2x SSC- 
0.1%SDS at 60°C) with the RRV-specific probes. By this analysis and restriction endonuclease 
mapping, the recombinant cosmids were aligned and a set of recombinants was identified that 
represented the entire viral genome when compared to digested viral DNA. 
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EXAMPLE 5 
Cloning and Sequencing 
Ten micrograms of each purified recombinant cosmid (Example 4) were digested with 
EcoRI and the resulting fragments isolated from a 0.8% agarose gel using the QiaQuick gel extraction 
5 protocol (Qiagen). Recovered fragments were ligated into pSP73 (Promega). Individual clones were 
selected by restriction enzyme screening of DNA recovered by aUralinpi lysis from overnight cultures. 
Sequencing templates were prepared by alkaline lysis, followed by precipitation with 6.5% 
polyethylene glycol and 0.8 M Nad Templates were resuspended at a concentration of 0. 1 ug/ul 
and end sequences were determined using primers corresponding to the SP6 and 17 promoters of 

10 pSP73. Internal sequences were determined using a combination of subcloning using convenient 
restriction sites and custom primers. DNA sequencing reactions were performed with the Applied 
Biosystems (ABI) PRISM Dye Terminator Cycle Sequencing Ready Reaction kits with AmpliTaq 
DNA polymerase per the manufacturer's instructions. Sequence data was acquired using an ABI 
373A Sequencer in the Molecular Biology Core at the Oregon Regional Primate Research Center. 

IS The primary EcoRI fragments were sequentially arranged by sequencing across the EcoRI sites in the 
intact cosmids using custom primers. Except for those regions containing long, high GC repeat units, 
the entire viral DNA sequence was determined with a redundancy of 3- to 4-fold. 

Sequences not accessible to custom primers or restriction subcloning were determined 
following deletion subcloning using the Exo Size Deletion kit (New England Biolabs). To 

20 accommodate this protocol, fragments were subcloned into vectors with restriction sites capable of 
generating the needed 3* and 5* overhanging ends. Double restriction digests to generate 3 1 and 5* 
overhanging ends were performed on 10 ug of recombinant plasnrid DNA, which was then subjected 
to exonuclease m digestion. Aliquots were removed from the exonuclease m digests at empirically- 
determined time points, frozen on dry ice, then, after all the time points had been collected, incubated 

25 for 1 5 minutes at 65°C to inactivate the enzyme. The DNA was then treated with Mung bean 

nuclease (MBN) for 30 rninutes at 30°C. Prior to addition of 3 ul of MBN to the 12 ul exonuclease 
m product, the enzyme was diluted 1/25 to reduce nonspecific digestion. Nuclease-treated DNA was 
recovered using the Wizard prep system (Promega), then incubated for 30 minutes with 2.5 units of 
T4 DNA polymerase (Life Technologies) and 1 uM dNTPs at 37°C. The final product was ligated 

30 overnight with T4 DNA ligase and used to transform competent XL1 bhie bacteria. Deletion 
products were size selected by restriction digests of DNA recovered from 3 ml cultures. 
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EXAMPLE 6 



Assembly of the RRV Sequence, Assignment of ORFs, and Nomenclature 



Factura (ABI) and Autoassembler (ABI) were used to assemble the final sequence from 
individual sequencing runs. Open reading frames in the RRV sequence were determined with the 
5 program Mac Vector (Oxford Molecular Group), using a setting of 100 ox more amino acids. Putative 
ORFs were then translated and compared to a database ofKSHV ORFs. RRV ORFs which matched 
KSHV ORFs were then compared to GenBank using BLASTP to verify the similarity , followed by a 
Gap analysis (Wisconsin GOG analysis package; Oxford Molecular Group) to determine the levels of 
similarity and identity between the RRV and KSHV proteins. When a gap in the genome of RRV 
10 corresponded to the location of a KSHV ORF with less than 100 amino acids, MacVector was reset to 
a lower limit RRV ORFs were assigned the names of HVS ORFs when they showed similarity to 
KSHV ORFs with the same name. 

The nucleotide sequence data from this study have been deposited in the GenBank, EMBL, 
and DDB J nucleotide sequence databases under accession number AF083501 (SEQ ID NO 1). 



The genomic nucleotide sequence of the RRV genome (as shown in SEQ ID NO 1) was 
determined using twenty-nine EcoRI fragments (as shown in FIG. 2) from seven overlapping isolates 

20 of a partial Sau3A I cosmid library. Cosmids were selected by hybridization with PCR products from 
KSHV ORFs. EcoRI fragments from each cosmid were subcloned into pSP73 (Promega) and 
sequenced. The EcoRI fragments were arranged in the proper order by sequencing across the EcoRI 
junctions in the parent cosmids using custom primers. Greater than 98% of the viral genome was 
determined on both strands. The average sequencing redundancy was between 3 and 4, but three 

25 regions were sequenced on only one strand. One of these regions is a 106 bp segment of ORF 61 that 
was blocked on one side by an apparent hairpin. This segment was sequenced multiple times in one 
direction using templates derived from independent overlapping cosmids. The other two regions are 
1 kb, high G + C, repetitive sequences. These segments, which are discussed in more detail below, 
were sequenced completely on one strand using a combination of custom primers and exonuclease in 

30 deletions. 

Terminal repeats were identified on both the left and right ends of the genome and the 
sequence between them was designated as the The first base to the right of the 

left terminal repeat was designated base one. The LUR is 131,634 bp long (SEQ ID NO l).TheG + 
C content of RRV is 52.2%, which is comparable to the 53.5% G + C content ofKSHV, but 
35 considerably higher than the 34.5% G + C content of the HVS genome. The CpG ratio is 1.11, which 
is substantially higher than the ratio found for other garnira-herpesviruses. 

ORFs were identified by MacVector and compared to a database containing the full 
complement of known KSHV ORFs. Matches between RRV and KSHV proteins were verified by a 
BLASTP search of GenBank with the RRV proteins and then by Gap analysis. The initial screening 
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for ORFs used a mmimumsize limit of 100 amino acids. This limit was reduced when smaller 
KSHV ORFs existed in locations corresponding to unassigned regions of RRV. Using this approach, 
82 ORFs were identified, with 67 of these corresponding to ORFs found in both KSHV and HVS. In 
accordance with the standard nomenclature for rtradinoviruses, these ORFs were labeled according to 
the HVS designation. The 15 ORFs not found in HVS were assigned labels beginning with R (for 
rhesus), indicating their presence in RRV, but not HVS. Some of these genes have counterparts in 
KSHV. 

A map of the genome of RRV is presented in FIG. 3, with all identified ORFs and their 
orientations. The BarnHI, EcoRI, and Hindm restriction sites in relation to the genome are shown in 
FIG. 2^ The BarnHI and Hind HI maps were generated from the final compiled sequence. The EcoRI 
map was also generated from the final compiled sequence, but ft was farther characterized by 
sequencing across the EcoRI junctions in the parent cosrnids. Fragment sizes for each restriction map 
are presented in FIG. 4. 

EXAMPLE 8 
Genomic Organization of RRV 

The overall genomic organization of RRV matches the general structure of gamma- 
herpesviruses, with blocks of shared ORFs interrupted at specific locations (referred to as divergent 
loci) where the viral genomes code for acquired cellular genes. The primate rhadinoviruses form a 
subset of the gamma-hexpesviruses and their genomes are rorrespondingly more similar to each other 
than to other members of the family. 

The genomic sequence of RRV is presented in SEQ ID NO 1. FIG. 3 shows a schematic 
representation of the ORFs of RRV with a corresponding restriction map. FIG. 4 shows the location, 
size and description of the RRV ORFs. 

EXAMPLE 9 
Comparison of RRV and KSHV ORFs 
A comparison of corresponding repeats in RRV and KSHV is shown in FIG. 5. la addition, 
FIG. 5 presents data for RRV ORFs along with the results of the Gap analysis of ORFs shared by 
RRV, KSHV, and HVS. All HVS-like ORFs found in KSHV are found in RRV. A comparison table 
of int er f e r on regulatory elements encoded by the RRV and KSHV genomes is shown in FIG. 6. 

EXAMPLE 10 
Comparison of RRV and HVS ORFs 
FIG. 7 shows the results of the Gap analysis of ORFs shared by RRV, KSHV, and HVS. In 
general, RRV and HVS ORFs are highly similar when the corresponding RRV and KSHV ORFs are 
highly similar, although the Gap values are generally lower. 
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EXAMPLE 11 
ORFs Unique to RRV and KSHV 

RRV includes 14 genes which are not found in HVS (Rl; R2 , R3, R4; R5; R6; R7; R8; R9; RIO; 
Rl 1; R12, R13; and R15 (see US 99/26260). These are designated in FIG. 3 as "R" ORFs. Of these 
S fifteen genes, 1 1 have counterparts in the genome of KSHV. R2 and R3 are cytokine genes. R2has 
functional homology to K2, the vIL-6 gene of KSHV. Gap analysis of the vIL-6 genes from KSHV 
and RRV shows no notable similarity, but both possess four conserved cysteines found in cellular IL- 
6. In addition, RRV vEL-6 has EL~6-like activity in cell culture. R3 has a small, but clear, similarity 
to KSHV K4, a vMIPip gene. It is the only vMIP gene in RRV, as compared to the three vMIP 

10 genes found in KSHV. 

RRV R6 through R13 are vIRFs as are KSHV K9 through Kl 1 (FIG. 6). K9, the most 
studied of the KSHV vIRFs, does not have a DNA binding domain, but has been demonstrated to 
inhibit the endogenous cellular interferon response pathways. Five of the RRV vIRFs (R6, R7, R8, 
RIO, and Rl 1) are similar to K9, though only RIO has a similarity greater than 30%. The remaining 

15 similarities fall between 26% and 30%. There is no measurable similarity between any RRV vIRF 
and any KSHV vIRF other than K9. There is, however, a pattern of higher similarity between 
members of the RRV vIRF family. R6 9 R7, R8, and R9 are most similar to R10, Rl 1, R12, and R13, 
respectively, with the similarities falling between 50% and 62%. The pattern of sirrrilarity suggests a 
single, possibly recent, gene duplication event for RRV which increased the number ofvIRFs in the 

20 genome from four to eight 

The final RRV gene with a unique KSHV counterpart is R15, which has some similarity to 
K14, a viral NCAM Ox-2 homologue. The shmlarity between R15 and K14 (35.2%) is relatively low 
compared to most other shared proteins. 

A nurnber of genes in RRV appear to be truly unique. Rl colocalizes with, but has no 

25 similarity to, Kl , a KSHV gene that has been demonstrated to have in vivo transforming ability. Kl 
and Rl both colocalize with ORF1, or STP (sairrriri transforming protein), although both Kl andRl 
are in opposite orientations compared to STP. A BLASTP search of GenBank using Rl reveals a 
limited ammo-terminal similarity to a series of Fc receptors, including a potential transmembrane 
domain. These data suggest that Rl, like Kl and STP, may have transforming potential via 

30 transmembrane signaling. 

R4 andRS are located between ORF 50 and 52, the same location as K8 and K8.1 in KSHV; 
however, there is no similarity between either R4 or R5 and the KSHV proteins. A BLASTP search 
of GenBank failed to show any significant ahgnrnents with either R4 or R5, so their functions are 
unknown. 

35 RRV has no confirmed ORFs in the region corresponding to K12, the ubiquitously 

expressed kaposin gene. A large ORF exists to the right of ORF 71, but it has no apparent control 
regions (TATA box or polyadenylation signal), so it has not been designated as a true ORF, pending 
identification of transcripts from this region. No ORFs corresponding to KSHV K15 have been 
identified. 
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EXAMPLE 12 
Co-localization of Repeat Units in RRV and KSHV 

The RRV genome contains three highly repetitive regions, which correspond to three of the 
5 repetitive regions of KSHV:^7j£, zppa, and mdsk (FIG. 5). KSHV fink and zppa, and the 
corresponding RRV repetitive regions, rDL-B and rDL-E> respectively, are tandem repeats. 

The first element of the RRV syko repeat is much lower in G + C content man the 
corresponding KSHV element, although the sizes are comparable (FIG. 5). The second element is 
over 700 bp longer than the corresponding KSHV element The first element of the RRV vrtgo 
1 0 repeat is 3 0% longer man the corresponding KSHV element, and the second RRV element is over 

four times as long as the second KSHV element There is no sequence similarity between the various 
elements of the two viruses nor is mere any similarity between any two repeat sequences in RRV. 

Not all repeat elements found in KSHV have corresponding repeats in RRV. This includes 
the KSHV vnct and waka/jwka repeats. This also includes the moi repeat, which is located in the 
15 center of the KSHV ORF 73 and is responsible for the divergent lengths of RRV and KSHV ORF 73. 
Moi is described in the annotations to the KSHV GenBank entry as having 1 5 different 1 1 - 1 6 bp 
repeats. The result of this repeat element is the presence in ORF 73 of a highly acidic central domain, 
with a large number of ghitamate residues coded by a iepeating GAG codon. KSHV ORF 73 is a 
potential leucine zipper protein, with a number of leucine zipper sites in the repeat region. RRV 
20 lacks the moi repeat and its concomitant acidic domain. It also lacks any evidence for a leucine 

zipper, indicating that the biology of ORF 73 in RRV may be substantially different than the biology 
of ORF 73 in KSHV. 

EXAMPLE 13 

25 Production of Simian Kaposi's Sarcoma (KS) and Lymphoproliferatfve Disorders Model 

This example describes how the RRV cloned above can be used to produce a non-human 
primate model for Kaposi's sarcoma and lymphoproliferative disorders. Four rhesus macaques 
(identification numbers 18483, 18503, 18540 and 18570) that were approximately 1.5 years old, and 
PCR- and seronegative for RRV were selected. To perform the antibody analysis, infected cells were 

30 sohibilized with 0.5% Nonidet P-40 and 1% sodium deoxy chelate in phosphate buffered saline, and 
clarified in a Beckman SW28 rotor at 23,500 rpm for 1 hour at 4°C. The clarified supernatant was 
used as antigen for coating enzyme-linked immunosorbent assay (HUSA) plates (500 ng/well). 
ELLS As were then performed essentially as described by Kodama et al. (AIDS Res Hum Retroviruses 
5:337-43, 1989). 

35 All of the animals were then inoculated intravenously with cell-free supernatants c ontaining 

the equivalent of 5 ng of p27 prepared from COS-1 cells transfected with an SIVmac239 molecular 
clone (Endres et aL, 1995, SW. J Med. Primatol 24: 141-4). The PBMCs from all macaques were 
prescreened for in vitro susceptibility to virus infection as described by Naidu et aL (J. Virol. 
62:4691-6, 1988). All inoculations and animal manipulations were performed according to 
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mstitational guidelines at the Oregon Regional Primate Research Center (Beaverton, OR). Every 3-4 
days for 4 weeks, then at 2-week intervals, macaques were sedated with ketanrine hydrochloride (10 
mg per kilogram of body weight) and examined for fever, weight loss, cutaneous signs, 
lymphadenoparhy, and hepatomegaly or splenomegaly. At these times, venipuncture was performed 
5 and blood specimens collected. Plasma was monitored for virus during the first 4 weeks with the SIV 
p27 enzyme-linked immunosorbent assay (ELISA) (Coulter Corp. Hialeah, FL.). T cell subsets and 
B cells were measured by flow cytometry with the OKT4 (CD4, Ortho), B9.ll (CD8, Coulter), and 
B-Ly-1 (CD20, Coulter) monoclonal antibodies. 

At 8 weeks post-SIV infection, rhesus macaques 18483 and 18570 were inoculated 

1 0 intravenously with 5 x 10 6 plaque forming units of gradient purified RRV that was grown and titered 
by plaque assay on primary rhesus fibroblasts. The two remaining macaques (1 8503 and 18540) 
were kept as SlV-infected controls. Every 3-4 days for 2 weeks, once a week for 4 weeks, then at 2 
week intervals, the macaques were examined and blood samples collected and analyzed. Virus 
isolations were performed by cocultivation of 2 x 10 s PBMCs from each of the macaques with 

1 S primary rhesus fibroblasts in duplicate. Cell cultures were monitored every 2-3 days for 3-4 weeks 
for cytopathic effects characteristic of RRV. PBLs were also analyzed by PCR for the presence of 
viral DNA. PCR analysis for RRV was performed with the following oligonucleotide primers: 
vMIP-1 and vMDP-2 (see PCT US 99/26260). The conditions for PCR were 94°C for 2 minutes (1 
cycle); 94°C for 0.5 minutes, 50°C for 0.5 min, 72°C for 0.5 minutes (30 cycles); 72°C extension for 

20 5 minutes (1 cycle). Each PCR reaction used 0.1 Fg of total DNA, 50 pmole of each primer, 1 U of 
Vent polymerase, 40 uM each of deoxynucleotide triphosphate, 10 mM KC1, 10 mM Tris-HCl (pH 
8.8), 10 mM (NHO2SO4, 2 mM MgS0 4 and 0.1% Triton X-100 in a final volume of 50 uL. The PCR 
reactions were run out on a 1% agarose gel, transferred to nitrocellulose, and probed with a 32 P-ATP- 
labeled oligonucleotide primer specific for vMBP-3 (see PCT US 99/26260). Hybridizations were 

25 performed overnight at room temr>erature in 6X SSC, 0.1% SDS and 10 ug/mL E. coti tRNA. 

Southern blots were then washed with 2X SSC and 0. 1% SDS twice at room temperature followed by 
two washes for 1 hour in 2X SSC and 0.1% SDS at 47°C. Bound probe was visualized by exposing 
NEN duPont reflection film to the washed membrane at 80°C with an NEN duPont Reflection screen. 
Infectious RRV was recovered from the peripheral blood mononuclear cells (PBMCs) of 

3 0 both RRV macaques inj ected with RRV as early as 4 weeks after inoculation for one macaque 
(18570) and 8 weeks for the other macaque (18483), but not from the control macaques. The 
peripheral blood leukocytes (PBL) from both macaques were also shown to harbor viral DNA as 
determined by PCR and Southern blot analysis for the viral MIP gene, as early as 4 weeks after 
inoculation for one macaque (18483) and as late as 14 weeks for the second macaque (18570). 

35 Additionally, antibody responses to RRV were observed as measured by ELISA in the RRV-infected 
macaques beginning 4 weeks post-infection, but not in the control macaques. 

Flow cytometry analysis (FACS) of PBLs at the indicated weeks post-infection (FIGS 8A- 
8D) showed there was limited CD4+ lymphocyte depletion after SIV infection in both groups of 
macaques followed by a rebound and sustained CD4+ lymphocyte counts. However, examination of 
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CD20+ B lymphocytes revealed significant differences between the two groups. The two control 
macaques exhibited a dramatic and sustained decline in CD20+ B lymphocytes (FIGs. 8C and 8D), 
whereas both co-infected macaques exhibited a transient increase in B lymphocytes beginning 6 
weeks after RRV infection (FIGs. 8A and 8B). The increase in CD20+ B lymphocytes correlated 
S with the isolation and/or detection of RRV in both macaques; however, viral load did not appear to 
correlate with the increase in CD20+ B lymphocytes when all samples from each macaque were 
analyzed simultaneously. It has been reported that CD23, a B cell activation marker, is induced by 
RhEBV infection of macaques (Moghaddam et al. 1997, Science 276:2020-33). FACS analysis of 
PBMCs from RRV-infected macaques revealed no detectable CD23+ cells. This would suggest that 

10 the mechanism responsible for increased numbers of GD20+- B lymphocytes following RRV infection 
differ from the activation of B lymphocytes by RhEBV. 

Routine physical examinations were performed on all four macaques, and early symptoms of 
SIV infection were observed in all four macaques by 2 weeks, including fever, rash and malaise. 
However, 1 1 weeks after inoculation with RRV, macaques 18483 and 18570 developed marked 

15 lymphadenopamy and splenomegaly, estimated to be enlarged 10 to 20 times the size of a normal 
spleen. In contrast, there was only slight lymph node enlargement in the control macaques not 
infected with RRV and no detectable enlargement of the spleen. Lymph node biopsies of the RRV- 
infected macaques revealed almost identical histology, characterized by a predominately follicular 
lesion with giant germinal centers and paracortical hyperplasia with increased vascularity, resembling 

20 angjofollicular lymph node hyperplasia associated with KSHV in Castleman's disease (Lachant et aL 
1985, Am. J. Clin. Pathol 83:27-33). In contrast, the lymph nodes of the control macaques exhibited 
atrophied lymphoid follicles and paracortical depletion characteristic of SlV-induced lymphoid 
atrophy (Oialifoux et aL, 1987, Am. J. Pathol 128:104-10; Ringler et aL, 1989, Am. /. Palhol 
134:373-83; Wyand et al, 1989, Am. J. Pathol 134:385-93). By FACS analysis, the majority of the 

25 lymph node mononuclear cells were CD20+ B lymphocytes in RRV-infected macaques, whereas 
CD4+ and CD8+ T lymphocytes predominated in the control macaques. 

The presence of viral DN A was determined by PGR analysis on DN A derived from PBLs . 
Detection of antibodies to RRV was determined by enzyme-linked rrnmunosorbent assay (EXISA) on 
plates coated with extracts derived from RRV-infected cells. By PCR analysis, RRV sequences were 

3 0 more prevalent in the lymph nodes than in the peripheral blood of RRV-infected macaques, whereas 
control macaques were negative for RRV sequences (FIGs. 9A and 9B). 

Additional disease manifestations were also observed in the RRV-infected macaques that 
parallel clinical features and B cell abnorrnalities observed in AIDS patients. 
Hypergarrirmglormlinemia was observed in the RRV-infected macaque that the virus was derived 

35 from, as well as in the macaques experimentally infected with RRV, whereas the two control 

macaques had ganornaglobulin levels similar to those before SIV infection. In addition, one of two 
RRV-infected macaques (18570) developed severe autoimmune hemolytic anemia 30 weeks after 
RRV infection, a condition frequently observed in MCD patients (Parravicini et aL, 1997, Am. J. 
Pathol 151:1517-22). 
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The second of the two RRV-infected macaques developed other unique clinical 
manifestations mat paralleled those of AIDS patients with KS. At 60 weeks post-RRV infection it 
developed a distended abdomen that was clinically evident upon physical exammatioa Palpation 
revealed a pronounced fluid accunnuation in the peritoneal cavity. This animal was euthanized due 
5 to persistent fluid accumulation and hyperbilirubinemia. Necropsy analysis on this animal revealed 
an abundance of ascites fluid, which was comprised predominately of CD20 B cells, as identified by 
FACS analysis. In addition, this animal exhibited a mesenchymal proliferative lesion throughout the 
viscera, that was identified by histopathological examination to be retroperitoneal fibromatosis (RF). 
RF is an abnormal highly vascularized mesenchymal proliferative lesion that exhibits histological 
10 features resenftling Kaposi's Sarcoma, Analysis of DNA isolated from the ascites and RF lesion by 
PGR with RRV NOP primers (given in Example 2) revealed a high viral load, implying RRV 
infection was responsible for these abnormal proliferations. 

EXAMPLE 14 

1 5 Other Methods to Prepare RRV Nucleic Acid Sequences 

Obtaining the RRV Viral Genome 

The RRV genome (SEQ ID NO 1) can be procured by de novo isolation from a viral culture. 
A biological sample of the virus (accession number VR-2601) may be obtained from the ATCC in 
Manassas, VA. This virus can be grown in vitro using primary rhesus fibroblasts (see Example 1). 

20 The virus is harvested from the culture supernatant and the infected host cells. Cellular debris is 

removed by centrifugation and intracellular virus particles may be released by somcation followed by 
centrifugation to pellet debris. The virus is then pelleted by centrifugation and further purified 
through a six-step sorbitol gradient The interface containing the virus is collected and the virus then 
pelleted by cenrrifugation, and the viral DNA released by SDS disruption. Viral DNA may be 

25 isolated by CsQ 2 gradient centrifugation. 

Obtaining Selected Polynucleotides from the Viral genome 

The isolated viral genome can be used as a source of polynucleotides as identified by the 
sequence as disclosed herein (SEQ ID NO 1). The polymerase chain reaction (PGR) may be used to 

30 amplify any polynucleotide selected from the known viral sequence using the viral genome as a 
source of template DNA. The template DNA may also be provided in the form of one or more 
cosmids that contain fragments of the viral genome. Alternately, cDNA, produced by reverse 
transcription of RNA extracted from RRV infected host cells, may be used as a template in a reverse- 
transcription PCR (RT-PCR) reaction. Methods and conditions for PCR and RT-PCR amplification 

35 are described in hmis et al (PCR Protocols. A Guide to Methods and Applications. 1 990, Innis et al 
(eds.), 21-27, Academic Press, Inc., San Diego, California). 

The selection of PCR primers may be made according to the portions of the genome to be 
amplified. Primers may be chosen to amplify small fragments of the genome, ORFs or fragments 
including many contiguous genes from the genome. Variations in amplification conditions may be 
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required to accommodate primers of differing lengths, and such considerations axe well known in the 
art and are discussed in Innis at al (PCR Protocols, A Guide to Methods and Applications. 1990, 
Irmis et al. (eds.), 21-27, Academic Press, Inc., San Diego, California), Sambrook et al (Molecular 
Cloning: A Laboratory Manual 2 nd e&, vol 1-3, Cold Spring Harbor, New York. 1989) and Ausubel 
5 et aL (Current Protocols in Molecular Biology, Greene PubL Assoc. & Wfley-Intersciences. 1987). 
For example, the ORF corresponding to the MIP gene may be amplified from an REV genomic (or 
appropriate cosrmd) template using a pair of primers (see PCT US 99/26260). Likewise, the ORF 
corresponding to the IL-6 gene maybe amplified fromanRRV genomic (or appropriate cosrmd) 
template using a pair of primers (see PCT US 99/26260). 
1 0 Many different primers may be selected from the sequence disclosed and used in PCR 

amplification reactions to amplify DNA sequences of interest from the RRV genome. 

Polynucleotides that may be obtained by the above methods include, for example: the entire 
polynucleotide genome of RRV as shown in SEQ ID NO 1 ; ORFs of this genome; oligonucleotides 
comprising at least 15, 20, 30, 40, 50, 70, 100 and 150 consecutive nucleotides of the genome 
1 5 sequence as shown in SEQ ID NO 1; nucleic acid sequences defined by nucleotides 1 to 1 1031 of 
SEQ ID NO 1 and nucleotides 21625 to 131634 of SEQ ID NO 1; and ORFs selected from these 
nucleic acid sequences. It is readily apparent that fragments of any length may be made using the 
above methods and information. 

EXAMPLE 15 

Therapeutic and Diagnostic Uses of the RRV or JMHV IL-6 Protein 

As disclosed herein, the genome of RRV possesses an IL-6 gene (FIG. 1), and the genome 
of JMHV posses an IL-6 gene, similar to that found in KSHV. The IL-6 and MIP proteins of KSHV 
are thought to be important in disease pathology, such as in Kaposi's sarcoma. The primary structure 
of the RRV IL-6 protein is shown in FIG. 10 (SEQ ID NO 21). Given the sequence information, one 
can readily make derivative proteins of RRV IL-6 or JMHV IL-6. In one specific, non-limiting 
example, such derivative proteins include proteins that differ from the primary amino acid sequence 
as shown in FIG. 10 (SEQ ID NO 21) by one or more conservative amino acid substitutions. 
Examples of such conservative substitutions are given in the DEFINITIONS section of the 
specification. Derivative proteins also include proteins amsisting of an amino acid sequence that has 
a denned degree of amino acid similarity with the RRV IL-6 or JMHV EL-6 protein. For instance, 
such derivative proteins will typically have at least 50% sequence similarity (and may have at least 
60%, 70%, 80%, 90%, 95%, 98% or even 99% sequence sirnilarity) with the RRV IL-6 protein. Such 
derivative proteins will not only share sequence similarity with KSHV IL-6 but will also possess IL-6 
biological activity. 

IL-6 is a cytokine known to have pleiotropic imrmmological effects including anti- 
inflammatory and imrramosuppressive effects, and may be used in several therapeutic and diagnostic 
applications. RRV EL-6 or JMHV IL-6 of the invention may be likewise be used. For instance, IL-6 
may be used to induce stimulation of hematopoietic stem cells, and to encourage proliferation, 
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differentiation and terminal maturation of erythroid cells from hematopoietic cells. Thus, for 
instance, RRV IL-6 or JMHV EL-6 may be used in vivo or ex vivo to treat diseases that involve 
leukopenia and thrombocytopenia. Such uses include stimulation of hematopoietic cells of 
radiotherapy patients or people exposed to radiation accidentally. EL-6 may be used in such 
5 applications in conjunction with GM-CSF (granulocyte-macrophage stimulating factor) (see U.S. 
Patent Nos. 5,61 0,056 and 5,620,685, herein incorporated by reference). IL-6 can also be used to 
stimulate growth of megakaryocytes and platelets, and for the inhibilion of tumor growth (see U.S. 
Patent No. 5,620,685, herein incorporated by reference). IL-6 can also be used for the treatment of 
leukemias, such as chronic myeloid leukemia (CML) and acute myeloid leukemia, by inducing 

10 terminal differentiation of cells with IL-6 (see WO 90/01943, herein incorporated by reference). 
RRV IL-6 or JMHV IL-6 may be used for all such applications. 

Therapeutic applications may involve the administration of RRV EL-6 or JMHV IL-6 in a 
iiumber of ways. RRV IL-6 ox JMHV IL-6 may be administered in vivo, e.g., by injection 
systemically or locally, for instance, into a subject Many other forms of in vivo lotion are 

IS possible including intravenous, subcutaneous, across a mucous membrane (anally, vaginally or 

sublingually), transdermal or by direct injection. Additionally, it may be administered ex vm?, by the 
removal of cells from a subject, the treatment of these cells in vitro with RRV IL-6 or JMHV IL-6, 
and the replacement of these cells into the subject Another recently developed method of delivery of 
a protein drug is by introducing the gene coding for the drug into a subject, for instance within the 

20 genome of a virus, such as an adenovirus or a retrovirus, whereby the protein is expressed in the 
subject Other modes of administration are provided in Example 25. 

Such examples are provided for illustrative purposes only and it will be seen that RRV IL-6 
or JMHV IL-6 may be used in a variety of topical and systemic immunological treatments where it 
would be desirable to stimulate cell proliferation or to induce anti-inflammatory or 

25 immunosuppressive effects. Additionally, IL-6 of the invention may be used for research and 
diagnostic purposes as discussed generally herein. For instance, EL-6 may be used to produce 
antibodies for diagnostic purposes to diagnose diseases characterized by increased or decreased 
production of EL-6, and the nucleic acid sequence encoding IL~6 may be used to produce probes and 
primers for diagnostic and research purposes or for gene therapy applications. Hie IL-6 could also be 

30 used as a targeting molecule for identifying cells with receptors for IL-6, and for directing therapeutic 
agents to these cells, for example by linking detector or therapeutic molecules to IL-6. 



EXAMPLE 16 

35 Therapeutic and Diagnostic Uses of the RRV or JMHV MIP Protein 

The genome of RRV as disclosed herein possesses an MIP gene (FIGS. 1 and 11) similar to 
that found in KSHV. The primary structure of the RRV MIP protein is shown in FIG. 11 (SEQ ID 
NO 25). Similarly, JMHV encodes a MIP protein (see below). Given the sequence information, one 
can readily make derivative proteins of RRV or JMHV MIP. Such derivative proteins include 
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proteins that differ from the primary amino acid sequence as shown in FIG. 1 1 (SEQ ID NO 25), or 
from a JMHV MIP protein, by one or more conservative amino acid substitutions. Derivative 
proteins also include proteins consisting of an ammo acid sequence mat has a defined degree of 
amino acid sirnilarity with the RRV or JMHV MIP protein. Typically such derivative proteins will 
5 have at least 50% sequence similarity with the RRV or JMHV MIP protein, and may have at least 
60%, 70%, 80%, 90%, 95%, 98%, or even 99% sequence similarity. Such derivative proteins will 
not only share sequence similarity with KSHV MIP but will also possess MIP biological activity. 
MIP biological activity can be detected and quantified using bioassays as described in Kedal et al 
(Science 277:1656-9, 1997) and Boshoffet aL (Science 278:290-4, 1997) mat measure MIP 

1 0 concentrations using HIV inhibition and calcium mobilization, respectively. 

MIP is a cytokine mat activates neutrophils to undergo an oxidative burst and is also 
mtrinsically pyrogenic. The MIP genes and proteins of the invention may be used in several 
therapeutic and diagnostic ways. Hie RRV MIP protein may be used for the same applications as 
other MIP proteins. Treatment of wounds to promote healing by application of MIP to the wound 

15 site is discussed in U.S. Patent No. 5, 145,676. U.S. Patent No. 5,474,983 (herein incorporated by 
reference) discusses various methods of treatment of inflammatory diseases including asthma, 
allergies and dermatitis. U.S. Patent No. 5,656,724 (herein incorporated by reference) discloses the 
use of MIP to suppress proliferation of dividing myeloid cells e.g., far the treatment of neutropenia. 
Use of MIP to inhibit HIV is discussed by Kedal et al. (Science 277:1656-9, 1997). RRV or JMHV 

20 MIP may be used for all such applications. 

As illustrated for IL-6 above, MIP may be administered in various ways to provide a 
therapeutic effect including in vivo y ex vivo and by gene therapy. 

Such examples are provided for illustrative purposes only and it will be seen that MIP may 
be used in a variety of topical, systemic, in vivo and ex vivo immunological treatments where it would 

25 be desirable to activate neutrophils or to induce fever. Additionally, MIP of the invention may be 
used for diagnostic purposes as discussed generally herein. For instance, MIP may be used to 
produce antibodies for diagnostic purposes to diagnose diseases characterized by increased or 
decreased production of MIP, and the nucleic acid sequence encoding MIP may be used to produce 
probes and primers for diagnostic and research purposes, or for gene therapy applications. 

30 The MIP could also be used as a targeting molecule for identifying cells with receptors for 

MIP , and for directing therapeutic agents to these cells, for example by ImVrng detector or therap eutic 
molecules to MIP. 

Although Examples 15 and 1 6 provide examples of therapeutic uses of the RRV or JMHV 
IL-6 and MIP proteins, any of the other proteins encoded by the RRV or JMHV can also be 
35 administered therapeutically, or diagnostically. For example, RRV or JMHV proteins that induce 

pathological or physiological conditions in a recipient can be administered to stimulate that condition 
for study, or to provide an animal or human model of the condition. That model can then be used to 
study the condition, or treatments for it 
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EXAMPLE 17 



Expression of RRV or JMHV cDNA Sequences 
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* With the provision of the RRV genomic (SEQ ID NO 1), and the provision of JMHV 
sequences, the expression and purification of any of the RRV proteins, or JMHV proteins, by 
standard laboratory techniques, is now enabled. Fragments amplified as described herein can be 
cloned into standard cloning vectors and expressed in rommonry used expression systems consisting 
of a cloning vector and a cell system in which the vector is replicated and expressed. Purified 
proteins may be used for functional analyses, annlx>dy production, diagnosis, and patient therapy. 
Furthermore, the DNA sequences of the RRV cDNAs or JMHV cDNAs can be manipulated in 
studi es to understand the expression of RRV genes or JMHV genes and the function of their products. 
Mutant forms of RRV or JMHV may be isolated based upon information contained herein, and may 
be studied in order to detect alteration in expression patterns in terms of relative quantities, and 
functional properties of the encoded mutant RRV protein. Partial or full-length cDNA sequences, 
which encode for the protein, may be ligated into bacterial expression vectors. Methods for 
expressing large amounts of protein from a cloned gene introduced into E. coli may be utilized for the 
purification, localization and functional analysis of proteins. For example, fusion proteins consisting 
of amino terminal peptides encoded by a portion of the E. coli lacZ or upE gene linked to RRV 
protein or JMHV protein may be used to prepare polyclonal and monoclonal antibodies against this 
protein. Thereafter, these antibodies may be used to purify proteins by immunoaffhiity 
chromatography, in diagnostic assays to quantitate the levels of protein and to localize proteins in 
tissues and individual cells by hnmunofluorescence and microscopy. 

Intact native protein may also be produced in£. coli in large amounts for functional studies. 
Standard prokaryotic cloning vectors may also be used, for example pBR322, pUCIS or pUC\9 as 
described in Sambrook et aL (Molecular Qomng: A Laboratory Manual 2 nd ed., vol 1-3, Cold 
Spring Harbor, New York. 1989). Nucleic acids of RRVor JMHV may be cloned into such vectors 
which may then be transformed into bacteria such as E. coli which may then be cultured so as to 
express the protein of interest Other prokaryotic expression systems include, for instance, the 
arabmose-mduced pBAD expression system that allows tightly controlled regulation of expression, 
the IPTG-induced pRSET system that facilitates rapid purification of recombinant proteins and the 
IPTG-induced pSE402 system that has been constructed for optimal translation of eukaryotic genes. 
These three systems are available commercially from Invitrogen and, when used accenting to the 
manufacturer's instructions, allow murine expression and purification of proteins. 

Methods and plasmid vectors for producing fusion proteins and intact native proteins in 
bacteria are described in Sambrook et aL (Molecular Cloning: A Laboratory Manual Cold Spring 
Harbor, New York, 1989, Chapter 17). Such fusion proteins may be made in large amounts, are easy 
to purify, and can be used to elicit antibody response. Native proteins can be jjroduced in bacteria by 
placing a strong, regulated promoter and an efficient ribosome binding site upstream of the cloned 
gene. If low levels of protein are produced, additional steps may be taken to increase protein 
production; if high levels of protein are produced, purification is relatively easy. Suitable methods 
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are presented in Sambrook et aL (Molecular Cloning: A Laboratory Manual Cold Spring Harbor, 
New York, 1989) and are well known in the art Often, proteins expressed at high levels are found in 
insoluble inclusion bodies. Methods for extracting proteins from these aggregates are described by 
Sambrook et aL (Molecular Cloning: A Laboratory ManuaL Cold Spring Harbor, New York, 1989, 
Chapter 17). 

Vector systems suitable for the expression of lacZ fusion genes include the pTJR series of 
vectors (Ruther and Muller-Hfll, 1983, EMBOJ. 2:1791), pEXl-3 (Stanley and Luzio, 1984, EMBO 
J. 3:1429) and pMRl 00 (Gray et aL, 1982, Proc. Natl. Acad. Set USA 79:6598). Vectors suitable for 
the production of intact native proteins include pKC30 (Shimatake and Rosenberg, 1981 , Nature 
292:128), pKK177-3 (Amann and Broshis, 1985, Gene 40:183) and pET-3 (Studiar and Moffett, 
1986,7. Mol Biol 189:113). The RRV or JMHV fusion protein may be isolated from protein gels, 
lyophilized, ground into a powder and used as an antigen. The DNA sequence can also be transferred 
to other cloning vehicles, such as other plasmids, bacteriophages, cosmids, animal viruses and yeast 
artificial chromosomes (YACs) (Burke et aL, 1987, Science 236:806-12). These vectors may then be 
introduced into a variety of hosts including somatic cells, and simple or complex organisms, such as 
bacteria, fungi (Timberlake and Marshall, 1989, Science 244:1313-7), invertebrates, plants (Gasser 
and Fraley, 1989, Science 244:1293), and mammals (Pursel et aL, 1989, Science 244:1281-8), which 
cell or organisms are rendered transgenic by the introduction of one or more heterologous RRV 
DNAs and/or JMHV DNAs. 

Various yeast strains and yeast-derived vectors are commonly used for expressing and 
purifying proteins, for example, Pichia pastoris expression systems are available from Invitrogen 
(Carlsbad, CA). Such systems include suitable Pichia pastoris strains, vectors, reagents, 
transformants, sequencing rjrrmers and media. 

Non-yeast eukaryotic vectors can also be used for expression of the RRV or JMHV proteins. 
Examples of such systems are the well known Baculo virus system, the Ecdysone-inducible 
mammalian expression system that uses regulatory elements from Drosophila melanogaster to allow 
control of gene expression, and the Sindbis viral expression system that allows high level expression 
in a variety of mammalian cell lines. These expression systems are available from Invitrogen. 

For expression in mammalian cells, the cDNA sequence may be ligated to heterologous 
promoters, such as the simian virus SV40, promoter in the pSV2 vector (Mulligan and Berg, 198 1, 
Proc Natl Acad. Set USA 78:2072-6), and introduced into cells, such as monkey COS-1 cells 
(Gmzrnan, 198 1, Cell 23:175-82), to achieve transient or long-term expression. The stable 
integration of the chimeric gene construct may be maintained in mammalian cells by biochemical 
selection, such as neomycin (Southern and Berg, 1982, J. Mol Appl. Genet 1:327-41) and 
mycophoenohc acid (Mulligan and Berg, 1981, Proc Natl. Acad Set USA 78:2072-6). 

DNA sequences can be manipulated with standard procedures such as restriction enzyme 
digestion, fill-in with DNA polymerase, deletion by exonuclease, extension by terminal 
deoxynucleotide transferase, ligation of synthetic or cloned DNA sequences, site-directed sequence- 
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alteration via single- stranded bacteriophage intermediate or with the use of specific oligonucleotides 
in combination with PGR. 

The cDNA sequence (or portions derived from it) or a mini gene (a cDNA with an intron 
and its own promoter) may be introduced into eukaryotic expression vectors by conventional 
5 techniques. These vectors are designed to permit the transcription of the cDNA eukaryotic cells by 
providing regulatory sequences that initiate and enhance the transcription of the cDNA and ensure its 
proper splicing and poryadenylation. Vectors containing the promoter and enhancer regions of the 
SV40 or long terminal repeat (LTR) of the Rous Sarcoma virus and polyadenylation and splicing 
signal from SV40 are readfly available (Mulligan and Berg, 1981, Proc. Natl Acad Set USA 

10 78:2072-6; Gorman et aL, 1982, Proa Nad Acad. Sci USA 78:6777-81). The level of expression of 
the cDNA can be manipulated with mis type of vector, either by using promoters that have different 
activities (for example, the baculovirus pAC373 can express cDNAs at high levels in S.frugiperda 
cells (Summers and Smith, 1985, Genetically Altered Viruses and the Environment Fields et aL 
(Eds.) 223 1 9-328, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York) or by 

15 using vectors that contain promoters amenable to modulation, for example, the gmcocorticoid- 

responsfve promoter from the mouse mammary tumor virus (Lee et aL, 1982, Nature 294:228). The 
expression of the cDNA can be monitored in the recipient cells 24 to 72 hours after introduction 
(transient expression). 

In addition, some vectors contain selectable markers such as the gpt (Mulligan and Berg, 

20 1981, Proc Natl Acad Set USA 78:2072-6) or neo (Southern and Berg, 1982, J. Mol Appl Genet 
1 :327-41) bacterial genes. These selectable markers permit selection of transfected cells mat exhibit 
stable, long-term expression of the vectors (and therefore the cDNA). The vectors can be maintained 
in the cells as episomal, freely replicating entities by using regulatory elements of viruses such as 
papilloma (Sarver et aL, 1981, Mol Cell Biol 1:486) or Epstein-Barr (Sugden et aL, 1985, Mol Cell 

25 Biol 5:4 10). Alternatively, one can also produce cell lines that have integrated the vector into 
genomic DNA. Both of these types of cell lines produce the gene product on a continuous basis. 
One can also produce cell lines that have amplified the number of copies of the vector (and therefore 
of the cDNA as well) to create cell lines that can produce high levels of the gene product (Alt et aL, 
1978,7. Biol Chenu 253:1357). 

30 The transfer of DNA into eukaryotic, in particular frnrrmn or other mammalian cells, is now 

a conventional technique. The vectors are introduced into the recipient cells as pure DNA 
(transfection) by, for example, precipitation with calcium phosphate (Graham and vander Eb, 1973, 
Virology 52:466) or strontium phosphate (Brash et aL, 1987, Mol Cell Biol 7:2013), electroporation 
(Neumann et aL, 1982, EMBOJ. 1:841), hpofection (Feigner et aL, 1987, Proc. Nail. Acad Sci USA 

35 84:7413), DEAE dextran (McGrman et aL, 1968, J. Natl. Cancer Inst 41:351), mcromjection 
(Mueller et aL, 1978, Cell 15:579), protoplast fusion (Schamer, 1980, Proc. Nad. Acad Set USA 
77:2163-7), or pellet guns (Klein et aL, 1987, Nature 327:70). Alternatively, the cDNA can be 
introduced by infection with virus vectors. Systems are developed that use, for example, retroviruses 
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(Bernstein et aL, 1985, Gen. Engrg. 7:235), adenoviruses (Ahmad et aL, 1986, J. Virol 57:267), or 
Herpes virus (Spaete et aL, 1982, Cell 30:295). 

These eukaryotic expression systems can be used for studies of RRV ox JMHV genes and 
mutant forms of these genes, and the RRV or JMHV proteins and mutant forms of these proteins. 
5 Such uses include, for example, the identification of regulatory elements located in the 5* region of 
RRV genes or JMHV genes on genomic clones that can be isolated from genomic DNA libraries, 
such as human or mouse libraries, using the information contained in the present invention. The 
eukaryotic expression systems may also be used to study the function of the normal complete protein, 
specific portions of the protein, or of naturally occurring or artificially produced mutant proteins. 

1 0 Naturally occurring RRV or JMHV wild-type or mutant proteins may exist in a variety of cancers or 
diseases, while artificially produced mutant proteins can be designed by site directed mutagenesis as 
described above. These latter studies may probe the function of any desired amino acid residue in the 
protein by mutating the nucleotide coding for that amino acid 

Using the above techniques, the expression vectors containing RRV or JMHV genes or 

1 5 cDNA sequence or fragments or variants or mutants thereof can be intra duced into human cells, 
mammalian cells from other species or non-mammalian cells as desired. The choice of cell is 
determined by the purpose of the treatment For example, monkey COS cells (Gruzman, 1981, Cell 
23: 175-82) that produce high levels of the SV40 T antigen and permit the replication of vectors 
containing the SV40 origin of replication may be used. Similarly, Chinese hamster ovary (CHO), 

20 mouse NIH 3T3 fibroblasts or human fibroblasts or lymphoblasts may be used. 

One method that can be used to express RRV or JMHV polypeptides from the cloned RRV 
or JMHV cDNA sequence in mammalian cells is to use the cloning vector, pXTL This vector is 
cormnercially available from Stratagene, contains the Long Terminal Repeats (LTRs) and a portion of 
the GAG gene from Moloney Murine Leukemia Virus. The position of the viral LTRs allows highly 

25 efficient, stable transection of the region within the LTRs. The vector also contains the Herpes 

Simplex Thymidine Kinase promoter (TK), active in embryonal cells and in a wide variety of tissues 
in mice, and a selectable neomycin gene com^rring G418 resistance. Two unique restriction sites 
BgHI and Xhol are directly downstream from the TK promoter. RRV or JMHV cDNA, mchiding the 
enure open reading frame for an RRV protein or JMHV protein such as IL-6 and the 3* untranslated 

30 region of the cDNA is cloned into one of the two unique restriction sites downstream from the 
promoter. 

The ligated product is transfected into mouse NTH 313 cells using Lipofectin (Life 
Technologies, Inc.) under conditions outlined in the product specification. Positive transfectants are 
selected after growing the transfected cells in 600 ug/ml G4I8 (Sigma, St Louis, MO). The protein 
35 is released into the supernatant and may be purified by standard imrmrnoafBnity chromatography 
techniques using aiitibodies raised against RRV proteins (see Example 18). 

Expression of RRV or JMHV proteins in eukaryotic cells can be used as a source of proteins 
to raise antibodies. The RRV or JMHV proteins may be extracted following release of the protein 
into the supernatant as described above, or, the cDNA sequence may be incorporated into a 
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eukaryotic expression vector and expressed as a chimeric protein with, for example, P-globhx 
Antibody to P-globin is thereafter used to purify the chimeric protein. Corresponding protease 
cleavage sites engineered between the P-globin gene and the cDNA are then used to separate the two 
polypeptide fragments from one another after translation. One useful expression vector for 
5 generating p-globin chimeric proteins is pSG5 (Stratagene). Tins vector encodes rabbit p-globin. 

The present disclosure thus encompasses recombinant vectors which comprise all or part of 
RRV or JMHV genome or cDNA sequences, for expression in a suitable host The RRV or JMHV 
DNA is operativery linked in the vector to an expression control sequence in the recombinant DNA 
molecule so that a RRV or JMHV polypeptide can be expressed. The expression control sequence 

1 0 may be selected from die group consisting of sequences that control the expression of genes of 

prokaryoric or eukaryotic cells and their viruses and combinations thereof The expression control 
sequence maybe specifically selected from the group consisting of the lac system, the trp system, the 
tac system, the trc system, major operator and promoter regions of phage lambda, the control region 
of fd coat protein, the early and late promoters of SV40, promoters derived from polyoma, 

1 5 adenovirus, retrovirus, baculovirus and simian virus, the promoter for 3-phosphoglycerate kinase, the 
promoters of yeast acid phosphatase, the promoter of the yeast alpha-rnating factors and combinations 
thereof. 

The host cell, which may be transfected with the vector of this invention, may be selected 
from the group consisting of: E. coll, Pseudomonas, Bacillus subtilis, Bacillus stearothermophilus or 
20 other bacilli; other bacteria; yeast; fungi; plant; insect; mouse or other ammal; or human tissue cells. 

It is appreciated that for mutant or variant RRV or JMHV DNA sequences, similar systems 
are employed to express and produce the mutant or variant product 

EXAMPLE 18 

25 Production of Antibodies to RRV and RRV Proteins or JMHV and JMHV proteins 

Polyclonal or monoclonal antibodies (including humanized monoclonal antibodies) md 
fragments of monoclonal antibodies such as Fab, F(ab r )2 and Fv fragments, as well as any other agent 
capable of specifically binding to an RRV or JMHV protein, may be produced to the RRV virion, the 
JMHV virion, or any of the RRVor JMHV proteins (for example odd-numbered SEQ ID Nos 3-165). 

30 Optimally, antibodies raised against an RRVor JMHV protein would specifically detect die RRV or 
JMHV protein of interest (or a virion containing the protein of interest). That is, such antibodies 
would recognize and bind the protein and would not substantially recognize or bind to other proteins 
found in human or other cells. The determination that an antibody specifically detects the RRV or 
JMHV protein is made by any one of a number of standard irnmunoassay methods; for instance, the 

35 Western blotting technique (Sambrook et aL, 1989, Molecular Qornng: A Laboratory Manual, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, New York). 

To determine that a given antibody preparation (such as one produced in a mouse) 
specifically detects the RRVor JMHV protein by Western blotting, total cellular protein is extracted 
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from murine myeloma cells and electrophoiesed on a SDS-polyacrylamide gel. The proteins are men 
transferred to a mernbrane (for example, nitrocellulose) by Western blotting, and the antibody 
preparation is incubated with the membrane. After washing the membrane to remove non- 
speciflcally bound antibodies, the presence of specifically bound antibodies is detected by the use of 
5 an anti-mouse antibody conjugated to an enzyme such as alkaline phosphatase; application of the 
substrate 5-bromo-4^Moro-3-mdolyi phosphate/nitro blue tetrazolium results in the production of a 
dense blue compound by immimo-localized alkaline phosphatase. Antibodies which specifically 
detect an RRVor JMHV protein will, by this technique, be shown to bind to the RRV or JMHV 
protein band (which will be localized at a given position on the gel determined by its molecular 

10 weight). Non-specific binding of the antibody to other proteins (such as serum altannin) may occur 
and may be detectable as a weak signal on the Western blot The non-specific nature of this binding 
will be recognized by one skilled in the art by the weak signal obtained on the Western blot relative 
to the strong primary signal arising from the specific antibody- VIAP protein binding. 

A substantially pure virion can be obtained, or substantially pure RRV or JMHV protein 

1 5 suitable for use as an immunogen is isolated by purification or recombinant expression. 

Concentration of protein in the final preparation is adjusted, for example, by concentration on an 
Amicon filter device, to the level of a few micrograms per milliliter. Monoclonal or polyclonal 
antibody to the protein can then be prepared as described by Harlow and Lane (Antibodies, A 
Laboratory ManuaL Cold Spring Harbor Press. 1988). 

20 Alternatively, antibodies may be raised against synthetic RRV or JMHV peptides 

synthesized on a commercially available peptide synthesizer (see Example 26) based upon the 
predicted amino acid sequence of the RRV or JMHV protein (Harlow and Lane, Antibodies, A 
Laboratory ManuaL Cold Spring Harbor Press. 1988). 

Another method of raising antibodies against RRV or JMHV proteins is by subcutaneous 

25 injection of a DNA vector which expresses the RRV protein into laboratory animals, such as mice. 
Delivery of the recombinant vector into the animals may be achieved using a hand-held form of the 
Biolistic system (Sanford et al., 1987, Particulate Set TechnoL 5:27-37) as described by Tang et aL 
{Nature 356:152-4, 1992). Expression vectors suitable for this purpose may include those which 
express the RRV protein under the transcriptional control of either the human P-actin promoter or the 

30 cytomegalovirus (CMV) promoter. 

Monoclonal Antibody Production by Hybridoma Fusion 

Monoclonal antibody to epitopes of the RRV or JMHV protein identified and isolated as 
described can be prepared from murine hybridomas according to the classical method of Kohler and 
35 Milstein (Nature 256:495, 1975) or derivative methods thereof. Briefly, a mouse is repetitively 

inoculated with a few micrograms of the selected protein over a period of a few weeks. The mouse is 
then sacrificed, and the antibody-producing cells of the spleen isolated. The spleen cells are fused by 
means of polyerhy|ene glycol with mouse myeloma cells, and the excess unfused cells destroyed by 
growth of the system on selective media comprising aminopterin (HAT media). The successfully 
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fused cells are diluted and aliquots of tbe dilution placed in wells of a nricrotrter plate where growth 
of the culture is continued. ABuT>ody-pro<riicmg clones are identified by detection of antibody in the 
supernatant fluid of the wells by immunoassay procedures, such as ETJSA, as originally described by 
Engvall {Enzymol. 70:419, 1980), and derivative methods thereof Selected positive clones can be 
5 expanded and their monoclonal antibody product harvested for use. Detailed procedures for 
monoclonal antibody production are described in Harlow and Lane (Antibodies: A Laboratory 
Manual 1988, Cold Spring Harbor Laboratory, New York). 

Polyclonal Antibody Production by Immunization 

1 0 Polyclonal antiserum amtaining antibodies to heterogeneous epitopes of a single protein can 

be prepared by irnrrnrnizing suitable animals with the expressed protein (e.g. see Example 17), which 
can be unmodified or modified to enhance rmrnonogenicity. Effective polyclonal antibody 
production is affected by many factors related both to the antigen and the host species. For example, 
small molecules tend to be less imrmmogenic than others and may require the use of carriers and 

IS adjuvant Also, host animals vary in response to site of inoculations and dose, with both inadequate 
or excessive doses of antigen resulting in low titer antisera. Small doses (ng level) of antigen 
acfrrrrnistered at multiple intradermal sites appears to be most reliable. An effective immunization 
protocol for rabbits can be found in Vartokaitis et aL (J. din. Endocrinol Metab. 33:988-91, 1971). 
Booster injections can be given at regular intervals, and antiserum harvested when antibody 

20 titer thereof as determined semi-quantitatively, for example, by double immunodiffusion in agar 

against known concenrrations of the antigen, begins to falL See, for example, Ouchterlony et aL (hr 
Handbook of Experimental Immunology. Wier, D. (ed). Chapter 19. Blackwell. 1973). Plateau 
concentration of antibody is usually m me range of 0.1 to 02 mg/ml of se™ 12 uM). 

Affinity of the antisera for the antigen is determmed by preparing competitive binding curves, as 

25 described for example, bv Fisher (Manual of Clinical Immunology, Chapter 42. 1980). 

Labeled Antibodies 

Antibodies of the present invention can be conjugated with various labels for their direct 
detection (see Chapter 9, Harlow and Lane, Antibodies: A Laboratory Manual 1988). The label, 
3 0 which may include, but is not limited to, a radiolabel, enzyme, fluorescent probe, or biotin, is chosen 
based on the method of detection available to the user. 

EXAMPLE 19 
Diagnostic Methods 

35 An emr>odirnent of the present invention is a method for screening a subject to determine if 

the subject has been infected with RRV or JMHV. One major application of the RRV sequence 
information presented herein is in the area of diagnostic testing for predisposition to a disease (such 
as Kaposi's Sarcoma and lymphoproliferative disorders for RRV, or for multiple sclerosis for JMHV) 
that develops in at least a sub-set of hosts infected with RRV or JMHV. The gene sequence of the 
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RRV genes, including intron-exon boundaries is also useful in such diagnostic methods. The method 
includes providing a biological sample obtained from the subject, in which sample includes DNA or 
RNA, and providing an assay for detecting in the biological sample the presence of any of the RRV 
ox JMHV genes or proteins. Suitable biological samples include samples obtained from body cells, 
5 such as those present in peripheral blood, urine, saliva, tissue biopsy, surgical specimen, fine needle 
aspirate specimen, amniocentesis samples and autopsy material The detection in the biological 
sample may be performed by a number of methodologies, as outlined below. 

The foregoing assay may be assembled in the form of a diagnostic kit and preferably 
comprises either hybridization with ohgcnucleotides; PGR amplification of the gene or a part 

10 thereof using oligonucleotide primers; RT-PCR amplification of the RNA or a part thereof using 
oligonucleotide primers; or direct sequencing of any of the RRV genes present in a subject using 
oligonucleotide primers. Hie efficiency of these molecular genetic methods should permit the rapid 
identification of patients infected with RRV. 

One embodiment of such detection techniques is the polymerase chain reaction 

1 5 amplification of reverse transcribed RNA (RT-PCR) of RNA isolated from cells (for example 

lymphocytes) followed by direct DNA sequence determination of the products. The presence of one 
or more RRV genes is taken as indicative of a potential RRV infection. The presence of one or more 
JMHV genes is taken an indicative of potential JMHV irifection. 

Alternatively, DNA extracted from lymphocytes or other cells may be used directly for 

20 amplification. The direct amplification from genomic DNA would be appropr iate for analysis of an 
entire RRV of JMHV gene including regulatory sequences located upstream and downstream from 
the open reading frame. Recent reviews of direct DNA diagnosis have been presented by Caskey 
{Science 236:1223-1228, 1989) and by Landegren et aL (Science 242:22»-37, 1989). 

Further studies of RRV or JMHV genes isolated from subjects may reveal particular 

25 mutations, deletions or alterations in gene sequences, which occur at a high frequency within 

particular populations of individuals. In this case, rather than sequencing the entire RRV gene, it may 
be possible to design DNA diagnostic methods to specifically detect the most common RRV 
mutations, deletions or alterations in gene sequences. 

The detection of specific DNA mutations or alterations in gene sequences may be achieved 

30 by methods such as hybridization using specific oligonucleotides (W allace et al, 1 986, Cold Spring 
Harbor Symp. Quant Biol 51:257-61), direct DNA sequencing (Church and Gilbert, 1984, Proa 
Nail Acad Sci USA, 81:1991-5), the use of restriction enzymes (Flavell et aL, 1978, Cell 15:25; 
Geever et aL, 1981, Proc. Natl Acad. Sci USA 78:5081), discrmnnation on the basis of 
electrophoretic mobility in gels with denaturing reagent (Myers and Maniatis, 1986, Cold Spring 

35 Harbor Symp. Quant Biol 51:275-284), RNase protection (Myers et aL, 1985, Science 230:1242), 
chemical cleavage (Cotton et al., 1985, Proc Nail. Acad Set USA 85:4397-401), and the ligase- 
mediated detection procedure (Landegren et aL, 1 988, Science 24 1 : 1077). 

Oligonucleotides specific to normal, mutant or alterative sequences are chemically 
synthesized using commercially available rnachines, labeled radioactively with isotopes (such as 32 P) 
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or non-radioactively, with tags such as biotin (Ward and Langer et al, 1981. Proc Natl. Acad ScL 
USA 78:6633-57), and hybridized to individual DNA samples immobilized on membranes or other 
solid supports by dot-blot or transfer from gels after electrophoresis. The presence of these specific 
sequences are visualized by methods such as autoradiography or fhiorometric (Landegren et al., 1989, 
5 Science 242:229-37) or colorirnetric reactions (Gebeyehu et aL, 1987 , Nucleic Acids Res. 15:4513- 
34). The absence of hybridization would indicate a mutation in the particular region of the gene, or 
that the patient is not infected with RRV or JMHV. 

Sequence differences between disclosed and other forms of RRV or JMHV genes may also 
be revealed by the direct DNA sequencing method of Church and Gilbert (Proc Natl Acad ScL USA 

10 81:1991-5,1988). Cloned DNA segments may be used as probes to detect specific DNA segments. 
The sensitivity of this method is greatly enhanced when combined with PCR (Wrichnik et al, 1 987, 
Nucleic Acids Res. 15:529-42; Wong et aL, 1987, Nature 330:384-6; Stoflet et aL, 1988, Science 
239:491-4). In mis approach, a sequencing primer which lies within the amplified sequence is used 
with double-stranded PCR product or single-stranded template generated by a modified PCR. The 

1 S sequence detennination is performed by conventional procedures with radiolabeled nucleotides or by 
automatic sequencing procedures with fluorescent tags. 

Sequence alterations may occasionally generate fortuitous restriction enzyme recognition 
sites or may eliminate existing restriction sites. Changes in restriction sites are revealed by the use of 
appropriate enzyme digestion followed by conventional gel-blot hybridization (Southern, 1975, J. 

20 Mol Biol 98:503). DNA fragments carrying die site (either normal, mutant, or alternative). are 
detected by their reduction in size or increase of corresponding restriction fragment numbers. 
Genomic DNA samples may also be amplified by PCR prior to treatment with the appropriate 
restriction enzyme; fragments of different sizes are then visualized under UV light in the presence of 
ethidium bromide after gel electrophoresis. 

25 Screening based on DNA sequence differences may be achieved by detection of alteration in 

electrophoretic mobility of DNA fragments in gels with or without denaturing reagent Small 
sequence deletions and insertions can be visualized by high-resolution gel electrophoresis. For 
example, a PCR product with small deletions is clearly distinguishable from a normal sequence on an 
8% mra-denaturmg polyacrylarrride gel (WO 91/10734; Nagamine et al., 1989, Am. J. Hum. Genet 

30 45:337-9). DNA fragments of different sequence compositions may be distinguished on denaturing 
formamide gradient gels in which the mobilities of different DNA fragments are retarded in the gel at 
different positions according to their specific ^aitial-rnehmg 1 ' temperatures (Myers et al., 1985, 
Science 230: 1242). Alternatively, a method of detecting a mutation comprising a single base 
substitution or other small change could be based on differential primer length in a PCR. For 

35 example, an invariant primer could be used in addition to a primer specific for a mutation. The PCR 
products of the normal and mutant genes can then be differentially detected in acrylarmde gels. 

In addition to conventional gel-electrophoresis and blot-hybridization methods, DNA 
fragments may also be visualized by methods where the individual DNA samples are not 
immobilized on membranes. The probe and target sequences may be both in solution, or the probe 
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sequence may be immobilized (SaDri et aL a 1989, Proc. Nat Acad Set USA 86:6230-4). A variety of 
detection methods, such as autoradiography involving radioisotopes, direct detection of radioactive 
decay (in the presence or absence of scinfillant), spectrophotometry involving calorigenic reactions 
and fhioTometry involved fhiorogenic reactions, may be used to identify specific individual 
5 genotypes. 

If more than one mutation or alternative sequence is frequently encountered in one or more 
RRV or JMHV genes, a system capable of detecting such multiple mutations would be desirable. For 
example, a PCR with multiple, specific oligonucleotide primers and hybridization probes may be 
used to identify all possible mutations or alternative sequences at the same time (Chamberlain et aL, 
1 0 1988, Nucl Acids Res. 1 6: 1 141-55). The procedure may involve immobilized sequence-specific 
oligonucleotides probes (Saiki et aL, 1989, Proc Nat Acad Set USA 86:6230-4). 

EXAMPLE 20 
Quantitation of RRVor JMHV Proteins 

15 An alternative method of determining if a subject has been infected with RRV or JMHV is 

to quantitate the level of one or more RRV (or JMHV) proteins in the cells of a subject This 
diagnostic tool would be useful for detecting the levels of RRV proteins which result from, for 
example, infection by RRV. This diagnostic tool would also be useful for detecting the levels of the 
JMHV proteins which result from infection by JMHV. These diagnostic methods, in addition to 

20 those described in EXAMPLE 19, provide an enhanced ability to diagnose susceptibility to diseases 
caused by RRV or JMHV infection. 

The detenmnation of RRV or JMHV protein levels would be an alternative or supplemental 
approach to the direct determination of the presence of one or more RRV (or JMHV) genes by the 
methods outlined above in EXAMPLE 19. Hie availability of antibodies specific to one or more of 

25 the RRV (or JMHV) proteins (for example those described in Example 18) will facilitate the 

quantitation of cellular RRV or JMHV proteins by one of a number of immunoassay methods which 
are well known in the art and are presented in Harlow and Lane (Antibodies. A Laboratory Manual 
Cold Spring Harbor Laboratory, New York. 1988). 

Such assays permit both the detection of RRV or JMHV proteins in a biological sample and 

30 the quantitation of such proteins. Typical methods involve: providing a biological sample of the 
subject in which the sample contains cellular proteins, and providing an immunoassay for 
qnantitatrng the level of RRV or JMHV protein in the biological sample. This can be achieved by 
combining the biological sample with an RRV or JMHV specific binding agent, such as an antibody 
(e.g. monoclonal or polyclonal antibodies that bind an RRV or JMHV protein), so that complexes 

35 form between the binding agent and the viral protein present in the sample, and then detecting or 
quantitating such complexes. 

In particular forms, these assays may be performed with the RRV specific binding agent or 
the JMHV specific binding agent immobilized on a support surface, such as in the wells of a 
rrncrotfter plate or on a column. The biological sample is then introduced onto the support surface 
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and allowed to interact with the specific binding agent so as to form complexes. Excess biological 
sample is then removed by washing, and the complexes are detected with a reagent, such as a second 
anti- RRV protein antibody, or anti-JMHV protein antibody, that is conjugated with a detectable 
marker. 

S In an alternative assay, the cellular proteins are isolated and subjected to SDS-PAGE 

followed by Western blotting, for example as described in Example 18. After resolving the proteins, 
the proteins are transferred to a membrane, which is probed with specific binding agents that 
recognize any of the RRV proteins or JMHV proteins. The proteins are detected, for example with 
HRP-conjugated secondary antibodies, and qiiantitated 

10 In yet another assay, the level of one or more RRV or JMHV proteins in cells is analyzed 

using microscopy. Using specific binding agents which recognize RRV or JMHV, samples can be 
analyzed for the presence of one or more RRV or JMHV proteins. For example, frozen biopsied 
tissue sections are thawed at room temperature and fixed with acetone at -200°C for 5 minutes. 
Slides are washed twice in cold PBS for 5 minutes each, then air-dried. Sections are covered with 20- 

15 30 pi of antibody solution (1 5-45 ug/ml) (diluted in PBS, 2% BSA at 1 5-50 ug/ml) and incubated at 
room temperature in humidified chamber for 30 minutes. Slides are washed three times with cold 
PBS 5 minutes each, allowed to air-dry briefly (5 minutes) before applying 20-30 pi of the second 
antibody solution (diluted in PBS, 2% BSA at 15-50 pg/ml) and incubated at room temperature in 
humidified chamber for 30 minutes. The label on the second antibody may contain a fluorescent 

20 probe, enzyme, radiolabel, biotin, or other detectable marker. The slides are washed three times with 
cold PBS 5 minutes each then quickly dipped in distilled water, air-dried, and mounted with PBS 
containing 30% glycerol. Slides can be stored at 4°C prior to viewing. 

For samples prepared for electron microscopy (versus light microscopy), the second 
antibody is conjugated to gold particles. Tissue is fixed and embedded with epoxy plastics, then cut 

25 into very thin sections (-1-2 urn). The specimen is then applied to a metal grid, which is then 

incubated in the primary anti-RRV antibody, washed in a buffer containing BSA, then incubated in a 
secondary antibody conjugated to gold particles (usually 5-20 nm). These gold particles are 
visualized using electron microscopy methods. 

For the purposes of quantrtaring the RRVor JMHV proteins, a biological sample of the 

30 subject, which sample includes cellular proteins, is required Such a biological sample may be 

obtained from body cells, such as those present in which expression of the protein has been detected. 
The expression of RRV or JMHV proteins in peripheral blood leukocytes is clearly the most 
accessible and convenient source from which specimens can be obtained. Specimens can be obtained 
from peripheral blood, urine, saliva, tissue biopsy, amniocentesis samples, surgical specimens, fine 

35 needle aspirates, and autopsy material, particularly cancer cells. Quantitation of RRV or JMHV 

proteins would be made by mrmunoassay and compared to levels of the protein found in non-RRV or 
non-JMHV expressing cells, or to the level of RRV proteins in non-RRV infected cells (cells of the 
same origin that are not infected), or to the level of JMHV proteins in non-JMHV infected cells (cells 
of the same origin that are not infected). A significant (preferably 50% or greater) increase in the 
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amount of one or more RRV proteins in die cells of a subject compared to the amount of one or more 
RRV proteins found in non-RRV infected cells or mat found in normal cells, would be taken as an 
indication that me subject may have been infected with RRV. Similarly, a significant (preferably 
50% or greater) increase in the amount of one ox more JMHV proteins in the cells of a subject 
5 compared to the amount of one or more JMHV proteins found in non-JMHV infected cells or mat 
found in normal cells, would be taken as an indication that the subject may have been infected with 
JMHV. 

EXAMPLE 21 

10 Sequence Variants of RRV or JMHV 

Sequence information for JMHV is shown in Example 31, and in the attached sequence 
listing. The amino acid sequence of JMHV proteins now fecilitates the creation of DNA molecules, 
and thereby proteins, which are derived from those disclosed by vary in their precise nucleic acid ox 
arnino acid sequence from those disclosed. Such variants may be obtained through a combination of 

1 5 standard molecular biology laboratory techniques and the nucleotide sequence information disclosed 
by this invention. 

Variant DNA molecules include those created by standard DNA mutagenesis techniques, for 
example, Ml 3 primer mutagenesis. Details of these techniques are provided in Sambrook et aL (In: 
Molecular aoning: A Laboratory Manual Cold Spring Harbor, New York, 1989, Chapter 15). By 

20 the use of such techniques, variants may be created which differ in minor ways from those disclosed. 
DNA molecules and nucleotide sequences which are derivatives of those specifically disclosed herein 
and which differ from those disclosed by the deletion, addition or substitution of nucleotides while 
still encoding a protein which possesses the functional characteristics of the RRV proteins are 
comprehended by mis invention. Also within the scope of this invention are small DNA molecules 

25 which are derived from the disclosed DNA molecules. Such small DNA molecules include 

oligonucleotides suitable for use as hybridization probes or polymerase chain reaction (PCR) primers. 
These small DNA molecules will comprise at least a segment of the RRV or cDNA molecules or the 
RRV gene and, for the purposes of PCR, will comprise at least a 15 or a 20-50 nucleotide sequence of 
the RRV cDNAs (even-numbered SEQ ID Nos 2-164) or the RRV genes (ie., at least 20-50 

30 consecutive nucleotides of the RRV cDNA or gene sequences). Alternatively, these small DNA 

molecules will comprise at least a segment of the JMHV or cDNA molecules or the JMHV gene and, 
for the purposes of PCR, wfll comprise at least a 15 or a 20-50 nucleotide sequence of the JMHV 
cDNAs or the JMHV genes (ie., at least 20-50 consecutive nucleotides of a nucleic acid encoding a 
JMHV protein). DNA molecules and nucleotide sequences which are derived from the disclosed 

35 DNA molecules as described above may also be defined as DNA sequences which hybridize under 
stringent conditions to the DNA sequences disclosed, or fragments thereof 

Hybridization conditions resulting in particular degrees of stringency will vary depending 
upon the nature of the hybridization method of choice and the composition and length of the 
hybridizing DNA used. Generally, the temperature of hybridization and the ionic strength (especially 
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the Na + concentration) of the hybridization buffer will determine the stringency of hybridization. 
Calculations regarding hybridization conditions required for attaining particular degrees of stringency 
are discussed by Sambrook et aL fln : Molecular Cloning; A Laboratory MarmaL Cold Spring Harbor, 
New York, 1989 cL 9 and 1 1), herein incorporated by reference. By way of illustration only, a 
5 hybridization experiment may be performed by hybridization of a DNA molecule (for example, a 
deviation of the RRV cDN A) to a target DNA molecule (for example, the RRV cDN A) which has 
been electrophoresed in an agarose gel and transferred to a nitrocellulose membrane by Southern 
blotting (Southern, J. Mol Biol 98:503, 1975), a technique well known in the art and described in 
Sambrook et aL (Molecular Clonine: A laboratory Manual Cold Spring Harbor, New York, 1989). 

1 0 Hybridization with a target probe labeled with ["P] -dCTP is generally carried out in a solution of 
high ionic strength such as 6xSSC at a terrrperature that is 20-25°C below the melting temperature, 
Tm, described below. For such Southern hybridization experiments where the target DNA molecule 
on the Southern blot contains 10 ng of DNA or more, hybridization is typically carried out for 6-8 
hours using 1-2 ng/ml radiolabeled probe (of specific activity equal to 10 9 CPM/ug or greater). 

IS Following hybridization, the nitrocellulose filter is washed to remove background hybridization. The 
washing conditions should be as stringent as possible to remove background hybridization but to 
retain a specific hybridization signal The term T m represents the temperature above which, under the 
prevailing ionic conditions, the radiolabeled probe molecule will not hybridize to its target DNA 
molecule. The T m of such a hybrid molecule may be estirnated from the following equation (Bolton 

20 and McCarthy, Proc. Natl Acad Set USA 48:1390, 1962): T m = 81.5°C - le.oXlogjotNa 4 ]) + 
0.41(%G+C) - 0.63(% formarnide) - (600/1); where 1 the length of the hybrid in base pairs. 

This equation is valid for concentrations of Na + in the range of 0.01 M to 0.4 M, and it is 
less accurate for calculations of T m in solutions of higher [Na 4 ]. The equation is also primarily valid 
for DNAs whose G+C content is in the range of 30% to 75%, and it applies to hybrids greater than 

25 1 00 nucleotides in length (the behavior of oligonucleotide probes is described in detail in Ch. 1 1 of 
Sambrook et aL (Molecular Qoning: A Laboratory ManuaL Cold Spring Harbor, New York, 1989). 

Thus, by way of example, for a 1 5 0 base pair DNA probe derived from the open reading 
frame of the RRV cDNA (with a hypothetical %GC = 45%), a calculation of hybridization conditions 
required to give particular stringencies may be made as follows: For this example, it is assumed that 

30 the filter will be washed in 03 x SSC solution following hybridization, thereby: {Na 4 ] = 0.045M; 

%GC = 45%; Formarnide concentration = 0; 1 = 150 base pairs; T m =81.5 - ^^(logioPSfa 4 ]) + (0.41 x 
45) - (600/150); and so T m = 74.4°C. 

The T m of double-stranded DNA decreases by 1-1 ,5°C with every 1% decrease in homology 
(Bonner et aL, J. Mol Biol 81:123, 1973). Therefore, for this given example, washing the filter in 

35 03 xSSC at 59.4-64.4°C will produce a stringency of hybridization equivalent to 90%; that is, DNA 
molecules with more than 10% sequence variation relative to the target RRV cDNA will not 
hybridize. Alternatively, washing the hybridized filter in 0.3 xSSC at a temperature of 65.4-o*8.4°C 
will yield a hybridization stringency of 94%; that is, DNA molecules with more than 6% sequence 
variation relative to the target RRV cDNA molecule will not hybridize. The above example is given 
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entirely by way of theoretical illustration- One skilled in the art will appreciate that other 
hybridization techniques may be utilized and that variations in experimental conditions will 
necessitate alternative calculations for stringency. 

In particular embodiments of the present invention, stringent conditions may be defined as 
5 those under which DNA molecules with more than 25%, 15%, 10%, 6% or 2% sequence variation 
(also termed "mismatch") will not hybridize. 

The degeneracy of the genetic code further widens the scope of the present invention as it 
enables major variations in the nucleotide sequence of a DNA. molecule while maintaining the amino 
acid sequence of the encoded protein. In one specific, non-lrnriting example, the eleventh amino acid 

10 residue of the RRV MIP protein is alanine . This is encoded in the RRV cDNA by the nucleotide 
codon triplet GOG. Because of the degeneracy of the genetic code, three other nucleotide codon 
triplets, GCT, GCA and GCQ also code for alanine. Thus, the nucleotide sequence of the RRV DNA 
could be changed at this position to any of these three codons without affecting the amino acid 
composition of the encoded protein or the characteristics of the protein. Based upon the degeneracy 

15 of the genetic code, variant DNA molecules may be derived from the DNA molecules disclosed 
herein using standard DNA mutagenesis techniques as described above, or by synthesis of DNA 
sequences. DNA sequences which do not hybridize under stringent conditions to the DNA sequences 
disclosed by virtue of sequence variation based on the degeneracy of the genetic code are herein also 
comprehended by this invention. 

20 The invention also includes DNA sequences that are substantially identical to any of the 

DNA sequences disclosed herein, where substantially identical means a sequence that has identical 
nucleotides in at least 75%, 80%, 85%, 90%, 95%, 98%, or even 99% of the aligned sequences. 

One skilled in the art will recognize that the DNA mutagenesis techniques described above 
may be used not only to produce variant DNA molecules, but will also facilitate the production of 

25 proteins which differ in certain structural aspects from the RRV or JMHV proteins, yet which 

proteins are clearly derivative of this protein and which maintain the essential characteristics of the 
RRV or JMHV proteins. Newly derived proteins may also be selected in order to obtain variations on 
the characteristic of the RRV or JMHV proteins, as described above. Such derivatives include those 
with variations in ammo acid sequence including minor deletions, additions and substitutions. 

30 While the site for introducing an amino acid sequence variation is predetermined, the 

mutation per se need not be predeterrmned For example, in order to optimize the performance of a 
mutation at a given site, random mutagenesis may be conducted at the target codon or region and the 
expressed protein variants screened for the optimal combination of desired activity. Techniques for 
making substitution mutations at predetennined sites in DNA having a known sequence as described 

35 above are well known. 

Amino acid substitutions are typically of single residues; insertions usually wOl be on the 
order of about from 1 to 10 amino acid residues; and deletions will range about from 1 to 30 residues. 
Deletions or insertions preferably are made in adjacent pairs, Le., a deletion of two residues or 
insertion of two residues. Substitutions, deletions, insertions or any combination thereof may be 
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combined to arrive at a final construct Obviously, the mutations that are made in the DNA encoding 
the protein must not place the sequence out of reading frame and preferably will not create 
complementary regions that could produce secondary mRNA structure. 

Substitutional variants are those in which at least one residue in the amino acid sequence has 
5 been removed and a different residue inserted in its place. Such substitutions generally are made 
conservatively, as defined above. 

The effects of these amino acid substitutions or deletions or additions may be assessed for 
derivatives of the JMHV proteins by assays in which DNA molecules encoding the derivative 
proteins are transfected into cells using routine procedures. These JMHV proteins are expressed 
1 0 recornbinantly (for example see Example 1 7), purified, and analyzed for their ability to cause 
symptoms associated with JMHV infection. 

EXAMPLE 22 
Cloning Virus in Other Species 
15 Hie genomic sequence of the rhesus macaque RRV (SEQ ID NO 1) and sequences of 

JMHV, facilitates the identification of DNA molecules, and thereby proteins, which are the RRV or 
JMHV homologs in other species. These other homologs can be derived from those sequences 
disclosed, but which vary in their precise nucleotide or amino acid sequence from those disclosed. 
Such variants may be obtained through a combination of standard molecular biology laboratory 
20 techniques and the nucleotide and amino acid sequence information disclosed by this invention. 

The Japanese macaque virus isolate was isolated from a lesion that was minced and co- 
cultured with primary rhesus fibroblasts. The isolate was then cloned by Irnrrittng dilution and a stock 
of virus generated from this clone. Total cellular DNA was harvested from virus infected cells and 
the DNA subjected to degenerate PGR for viral DNA polymerase, exactly as described above for 
25 RRV. Once confirmed, a cosmid library of this virus was made from purified viral DNA (as 
described for RRV) and men a portion of the protein genes was cloned and sequenced. 
Results for this analysis are shown in the following Table 1: 

TABLE 1 

30 RRV Sequences from Japanese Macaque 

Total number of amino acid residues inferred; 972 
Number of differences compared to RRV: 29 
Percent identity: 97.02% 

35 

Japanese Macaque Data 

These are end sequences. For ORFs represented twice, section I is from one plasmid, section II is 
from another plasrmd These are non-overlapping sections. 
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Orf 7 sectionl (see SEQIDNO: 14) 
GIJWSIDDTmAI^RDC^rvaTTQ 
AWAATAASDVIKDAQYRKEQY^ 
YGEASELVNHFUJUtftf^AIJ^ 
5 TGPLTRQSDHTGPArWVLAQCFE^ 

Orf7 section H (see SEQ ID NO: 14) 
PIESLFCGGIINSroDTINAI^^ 

ATPSEPRTATVXATAASDVIKDAQYRXEQYMKKVAia)XF^^ 
10 RMGGRRI 

Orf 8 (see SEQ ID NO: 24 and SEQ ID NO: 27) 
YRKVATSVTVYRGWTETAVTC 
DFmQTVHXJPVEGLTDMQRYFSQPVLYTI^ 
1 5 VTAIXjDTVEVSPFCXNDSTCSVADKAENGmVRV^ 

VSWKAEDPKSAVCALTLWKTFPRAIQTTHESQLPLCGQRR 

Orf9 section I (see SEQ ID NO: 20 and SEQ ID NO:25) 
WSRFQTDIIPSGTVHGJLGRTT^ 
20 GRAACGFSTRRVTKEIIX1TDVAEHPVTE 
FVIJDHGITTFGWYSCARATPRL^^ 

Orf 9 section H (see SEQ ID NO: 20 and SEQ ID NO:25) 
MDFFNPYIXiPRGPRPPSHKCTOAPAPAGAVQPPPDVCRI^ 
25 ENGARGDVIXAHERSMWTA 

Orfl0(seeSEQIDNO:18) 

AQMKDYAPGDPNAEI\nLGQSGPVLPTHTGGRVLG\nf 
ARGDIAFLVAGVAPEPIJFIVTPT^^ 

30 

Orf29b(SEQIDNO:28) 

NVAVEGNSSQDAGVAIATVLNEICSWLSFIJIHADKNT^ 

SGTPSASQTWSHTIKI^FDPVAYIJDQIKAm 

HYMATNDKFVFKSLE 

35 

The clones containing the Japanese macaque gamma?, herpesvirus was subsequently sequenced The 
nucleic acid sequence form the Japanese macaque garmna2 herpesvirus was then compared with the 
sequence of the rhesus macaque gamma? herpesvirus 17577. Based on a comparison of 2328 amino 
acid residues form fourteen independent regions (average length 166 ± 71), the percent similarity 
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between the two viruses was calculated to be an average of 94. 12% at the polypeptide level (range 
75.652% to 99.422%) when JMHV and RRV sequences were compared (see Example 22). 

Similarly, this invention now also facilitates the identification of other DNA molecules, and 
thereby proteins, which are the JMHV homologs in other species. These other homologs are derived 
5 or related to those sequences disclosed, but vary in their precise nucleotide or amino acid sequence . 
Such variants may be obtained through a combination of standard molecular biology laboratory 
techniques and the JMHV nucleotide and amino acid sequence information disclosed by this 
invention. 

In one specific, non-hmiting example, a human homolog of JMHV is isolated from a human 
10 MS lesion lesion that is minced and co-cultured with primary fibroblasts, using the methods 

described herein. In one specific, non-limiting example, primary tissue explant cultures and 

cocuhrvation of leukocytes with traget cells are used to enhance the isolation of a humari virus. Fresh 

tissue (e.g. spinal chord) is dissected from visible human MS lesion identified during necrosy. Tissue 

explant cultures are established from small tissue framents cultured in flasks (e. g. T-25 flasks) in 
15 media, such as Dulbecco's Modified Eagle Medium (DMEM) supplemented with serum, I^glutamine 

and antibiotics. In one example, 20% fetal calf serum is utilized- In another example, human serum 

is utilized. The explant cultures are fed by partial medium replacement, and passaged at coiifulence. 

Confident monlayer explant cultures are rinsed in calcium and Magnesium-free phospahte buffered 

saline, pH 7,4, and digested with trypsin-EDTA at 37°C, washed with grwoth medium and seeded 
20 into flasks. These first passage explant cultures are fed by complet medium replacement, and 

watched for the development of cytopathic effects. One cytopathic effects are noted, the cells are 

rapidly frozen in liquid nitrogen and thawed Cell-fee supernatants are clarified by centrifugation, 

fileted, and frozen at -80°C. 

Peripheral blood mononuclear cells (PBMC) are then isopycnic gradient-purified (Ficoll- 
25 Paque, Pharmacia) from fresh ammonium heparin anti-coagulated blood obtained from a human MS ' 

patient at necropsy. Gradient-purified PBMC are washed in c^crorn- and rnagnesium-free 

phosphate-buffered saline and ar^jroximately 3x1 0 6 PBMC are cocuhured with 1x10 s primary human 

fibroblasts in T-25 culture flasks containing DMEM growth medium supplemented with serum, 1% 

L-glutamine and antibiotics (e.g. 1% penicillinrstreptonryc^ The PBMC-fibroblast 

30 c^ooiltures are fed by partial medium replacement after several days in culture, and passaged at 

confluence at approximately day 9. Confluent PBMC-flbroblast cocuftures are rinsed in calcium- and 

magnesium-free phosphate-buffered saline, pH 7.4, digested with trypsin-EDTA at 37°C, washed 

with growth medium and then seeded into culture flasks. First passage PBMC-flbroblast cultures are 

fed by complete medium replacement at 3 to 5 day intervals and exhibit cytopathic effects (CPE) one 
35 to two weeks after passage. Once cytopathic are noted the cultures are rapidly frozen in liquid N 2 and 

thawed. Cell-free supernatants are clarified by c^trifugarion, filtered through a 0.45 micron 

membrane and stored frozen at -80*C. 

To prepare virus stocks, cell-free filtered supernatants are harvested from the explant 

cultures and the PBMC-flbroblast cocuftures and separately inoculated onto primary human fibroblast 
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cultures. Fibroblast cultures inoculated with explant supernatant and PBMC-fibroblast coculture 
supernatant develop CPE after inoculation. The cultures are rapidly frozen in liquid N 2 and thawed 
and cell-free supematants are clarified by centrifugation, filtered through a 0.45micron membrane 
and stored frozen at -80°C. 
5 The isolate is then cloned by limiting dilution, and a stock of virus is generated from this 

clone. Total cellular DNA is harvested from virus infected cells. The harvested DNA is then 
subjected to degenerate PGR for viral DNA polymerase, exactly as described above for JMHV and 
RRV. Once confirmed, a cosmid library of this virus is made from purified viral DNA (as described 
above for JMHV and RRV). The viral genes included in the cosmid library are subseqwuently 
10 cloned and sequenced. 



EXAMPLE 23 
Screening Assays for Pharmaceutical Agents of Interest 

15 

The present disclosure provides a virus that is involved in the causation or progression of 
certain diseases, such as KS or multiple sclerosis, and therefore provides an animal model and assays 
directed to identifying potential pharmaceutical agents that inhibit the biological activity of the virus. 
Drug screening assays which determine whether or not a drug has activity against the virus can 

20 include mcnbating a compound to be evaluated for use in tratrnent of the condition with cells which 
express the RRV or JMHV proteins or peptides, and determining the effect of the compound on the 
activity of the virus. In vitro assays in which the virus is maintained in suitable cell culture are 
preferred, though in vivo animal models would also be effective. 

in vitro assays include infecting cells such as rhesus or Japanese macaque fibroblasts, 

2 S neuronal cells, peripheral blood leukocytes or susceptible T cell lines such as MT-4 with the agent of 
interest in the presence of varying concentrations of compounds targeted against viral replication. 
These compounds include, but are not limited to nucleoside analogs, chain terminators, antisense 
oligonucleotides and random polypeptides. (Asada et aL 9 J. Clin. Microbiol 27:2204, 1989; Kiknta 
et al, Lancet 7:861, 1989). Infected cultures and their supematanrs can be assayed for the total 

30 amount of virus, including the presence of the viral genome, by quantitative PCR, by dot blot assays, 
or by using immunologic methods. For example, a culture of susceptible cells could be infected with 
the RRV or JMHV in the presence of various concentrations of drug, fixed on slides after a period of 
days, and examined for viral antigen by indirect immunofluorescence with monoclonal antibodies to 
viral polypeptides (Kikuta et al, supra). Alternatively, chemically adhered MT-4 cell monolayers can 

35 be used for an infectious agent assay using indirect irnmuno fluorescent antibody staining to search for 
focus reduction (Higashi, J. Clin. Microbiol 27:2204, 1989, incorporated by reference). 

As an alternative to whole cell in vitro assays, purified enzymes isolated from the RRV or 
JMHV can be used as targets for rational drug design to determine the effect of the potential drug on 
enzyme activity, such as thymidylate simthase or DNA polymerase. The genes for these two RRV 
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enzymes are provided herein. A measure of enzyme activity indicates an effect on the infectious 
agent itself Drug screens using herpes viral products are known and have been previously described 
in EP 0514830 (herpes proteases) and WO 94/04920 (U L 13 gene product). 

In particular embodiments, this invention provides an assay for screening anti-KS or anti- 
5 MS chemotherapeutics. Infected cells can be incubated in the presence of a chemical agent that is a 
potential chemotherapeutic against KS (e.g. acyclo-guanosine). The level of virus in the cells is men 
determined after several days by IFA for antigens or Southern blotting for viral genome or Northern 
blotting for mRNA and compared to control cells. This assay can quickly screen large numbers of 
chemical compounds that may be useful against KS. This invention also provides an assay system 

10 that is employed to identify drugs or other molecules capable of binding to the DNA molecule or 
proteins, either in the cytoplasm or in the nucleus, thereby inhibiting or potentiating transcriptional 
activity. Tins assay would be useful in the development of drugs that are specific against particular 
cellular activity, or that would potentiate such activity, in time or in level of activity. Also included 
are drugs identified by this assay which have an anti-viral activity, and an effect against conditions 

15 associated with RRV infection, such as KS. 

Similarly, this invention provides an assay for secreening anti-multrple sclerosis (MS) 
therapeutics. Infected cells can be incubated in the presence of a chemical agent that is a potential 
anti-inflammatory agent or other agent of use in treating MS. The level of virus in the cells is then 
determined by IFA for antigens, Southern blotting tor the viral genome, Northern blotting for mRNA, 

20 or PCR, and compared to control cell This assay can quickly screen large numbers of agents that 

may be useful in the treatment of MS. This invention also provides an assay system that is employed 
to identify drugs or other molecules capable of binding to the DNA molecule or proteins, either in the 
cytoplasm or in the nucleus, thereby inhibiting or potentiating transcriptional activity. This assay 
would be useful in the development of drugs that are specific against particular cellular activity, or 

25 that would potentiate such activity, in time or in level of activity. Also included are drugs identified 
by this assay which have an anti-viral activity, and an effect against conditions associated with JMHV 
infection, such as MS. 

EXAMPLE 24 

30 Generating Animal Models 

Animal models are useful for resolving a number of fundamental problems of infectious 
diseases that include, but are not limited to, determinants of virulence of the organism, fnecb ari % $nvi 
of host resistance, mechanisms of pathogenicity, establishment and regulation of chronic infection, 
and antimicrobial and chemotherapeutic actions of drugs on infectious agents. Variables that are 

35 commonly manipulated to address fundamental problems include, but are not limited to, the strain of 
infectious agent, the infecting dose of infectious agent and the route of administration of the 
infectious agent, the species or subspecies of animal, the age of animal, and the genetic background 
of the animal (Viral pathogenesis, N. Nathanson, Lippincot-Raven, Philadelphia, 1997). 
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In an embodiment in which one or more RRV strains are employed for generating an animal 
model, the RRV used may be naturally occurring variant isolates recovered from rhesus macaques 
and other non-human primate species, molecular clones generated from these naturally occurring 



variant isolates and recombinant viruses with introduced mutations, deletions or recombined genomes 



By manipulating the infecting dose and route of RRV administration virus-host interactions 
dependent upon dose and tissue or organ-specific disease manifestations can be explored Thus, the 
present invention includes various doses of RRV adrninistered by oral, inhalation, intratracheal, 
rntravagmal, intrarectal and parenteral routes including, but not limited to intravenous, intraarterial, 

10 intradermal, subcutaneous, intra rm lscular, intraperitoneal and organ-specific administration routes 
such «wi intracerebral and intraocular adrninistration. 

Many disease manifestations with a given infections agent are highly influenced by age and 
species or subspecies of the host and the particular genetic makeup of the host The present 
disclosure provides a virus that is involved in the causation or progression of certain diseases, such as 

1 5 KS, in the rhesus macaque, but is also useful for the study of and discovery of disease rnanifestations 
mat are host species, age and genetic background dependent In particular embocfirnents, one skilled 
in the art may vary the species of animal to which the RRV is administered to produce or discover a 
particular disease manifestation, or similarly vary the genetic background of the animal to produce or 
discover a particular disease rnanifestation, even including the use of genetically engineered animals. 

20 In another embodiment, in which one or more JMHV strains are employed for generating an 

animal model, me JMHV used may be naturally occurring variant isolates recovered from Japanese 
macaques and other non-human primate species, molecular clones generated from these naturally 
occurring variant isolates and recombinant viruses with introduced mutations, deletions or 
recombined genomes designed to address function of specific genes. 

25 By manipulating the infecting dose and route of JMHV administration virus-host 

interactions dependent upon dose and tissue or organ-specific disease rnanifestations can similarly be 
explored Thus, the present invention includes various doses of JMHV adrninistered by oral, 
inhalation, intratracheal, intravaginal, intrarectal and parenteral routes including, but not limited to 
intravenous, mtraarterial, intradermal, subcutaneous, intramuscular, intraperitoneal and organ-specific 

30 adrninistration routes such and intracerebral and intraocular administration. 

Many disease rnanifestations with a given infections agent are highly influenced by age and 
species or subspecies of the host and the particular generic makeup of the host The present 
disclosure provides a virus that is involved in the causation or progression of certain diseases, such as 
MS, in the Japanese macaque, but is also useful for the study of and discovery of disease 

35 rnanifestations that are host species, age and genetic background dependent In particular 
ernbodiments, one skilled in the art may vary the species of animal to which the JMHV is 
ao^ninistered to produce or discover a particular disease noamfestation, or similarly vary the genetic 
background of the animal to produce or discover a particular disease manifestation, even including 
the use of genetically engineered animals. 



5 designed to address function of specific genes. 
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EXAMPLE 25 

Pharmaceutical Compositions and Modes of Administration 

Various delivery systems for administering pharmaceutical proteins from the RRV or JMHV 
5 include encapsulation in liposomes, nricroparticles, microcapsules, egression by recombinant cells, 
receptor-mediated endocytosis (see Wu and Wu, J. Biol Cherru 1987, 262:4429-32), and construction 
of a therapeutic nucleic acid (such as an anti-sense molecule) as part of a retroviral or other vector. 
Methods of introduction include, but are not limited to, intradermal, intramuscular, intraperitoneal, 
intravenous, subcutaneous, intranasal, and oral routes. The compounds may be administered by any 

1 0 convenient route, for example by infusion or bohis injection, by absorption through epithelial or 

mucocutaneous linings (e.g., oral mucosa, rectal and intestinal mucosa, etc.) and may be adniinistered 
togemer with other biologically active agents. Administration can be systemic or locaL In addition, 
the pharmaceutical compositions may be introduced into the central nervous system by any suitable 
route, mchiding intraventricular and intrathecal injection; intraventricular injection may be facilitated 

15 by an intraventricular catheter, for example, attached to a reservoir, such as an Ommaya reservoir. 

The use of liposomes as a delivery vehicle is another delivery method of the present 
invention. The liposomes fuse with the target site and deliver the contents of the lumen 
mlraceflularly. The liposomes are maintained in contact with the target cells for a sufficient time for 
fusion to occur, using various means to maintain contact, such as isolation and binding agents. 

20 Liposomes may be prepared with purified proteins or peptides that mediate fusion of membranes, 
such as Sendai virus or influenza virus. The lipids may be any useful combination of known 
liposome forming lipids, including canonic lipids, such as phosphatidylcholine. Other potential lipids 
include neutral lipids, such as cholesterol, phosphatidyl serine, phosphatidyl glycerol, and the like. 
For preparing the liposomes, the procedure described by Kato et aL (J. Biol Chem. 1991,266:3361) 

25 may be used. 

The present invention also provides pharmaceutical compositions which include a 
therapeutically effective amount of one or more RRV, or one or more JMHV proteins, or RRV or 
JMHV DNA, alone or with a pharmaceutically acceptable carrier. 

The pharmaceutical compositions or methods of treatment may be administered in 
30 combination with other therapeutic treatments, such as other antineoplastic or antitumorigenic 
therapies. 

Administration of Nucleic Add Molecules 

In an embodiment in which one or more RRVor JMHV nucleic acids are employed for 
35 generating an animal model, the analog may be delivered intracelhilarly (e.g., by expression from a 
nucleic acid vector or by receptor-mediated mechanisms). In a specific embodiment where the 
therapeutic molecule is a nucleic acid, administration may be achieved by an appropriate nucleic acid 
expression vector which is administered so that it becomes intracellular, e.g., by use of a retroviral 
vector (see U.S. Patent No. 4,980,286), or by direct injection, or by use of microparticle 
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bombardment (e.g., a gene gun; Biolistic, Dupont), or coating with lipids or cell-surface receptors or 
transfecting agents, or by administering it in linkage to a homeobox-like peptide which is known to 
enter the nucleus (see e.g., Joliot et aL, Froc. Nail. Acad Set USA 1991, 88:1864-8). Alternatively, 
the nucleic acid can be introduced intracellularly and incorporated within host cell DN A for 
5 expression, by homologous recombination. 

The vector pCDN A, is an example of a method of introducing the foreign cDNA into a cell 
under the control of a strong viral promoter (CMV) to drive the expression. However, other vectors 
can be used. Other retroviral vectors (such as pRETRO-ON, Clontech), also use this promoter but 
have the advantages of entering cells without any transaction aid, integrating into the genome of 

10 target cells ONLY when the target cell is dividing (as cancer cells do, especially during first 

remissions after chemotherapy) and they are regulated It is also possible to turn on the expression of 
the RRV or JMHV nucleic acid by administering tetracycline when these plasrmds are used. Hence 
these plasmids can be allowed to transfect the cells, men administer a course of tetracycline with a 
course of chemotherapy to achieve better cytotoxicity. 

1 5 Other plasmid vectors, such as pMAM-neo (also from Clontech) or pMSG (Pharmacia) use 

the MMTV-LTR promoter (which can be regulated with steroids) or the SV 1 0 late promoter (pSVL, 
Pharmacia) or metallothionein - responsive promoter (pBPV, Pharmacia) and other viral vectors, 
mchiding retroviruses. Examples of other viral vectors include adenovirus, AAV (adeno-associated 
vims), recombinant HSV, poxviruses (vaccinia) and recombinant lennvirus (such as HIV). All these 

20 vectors achieve the basic goal of delivering into the target cell the cDNA sequence and control 

elements needed for transcription. The present invention includes all forms of nucleic acid delivery, 
including synthetic oligos, naked DNA, plasmid and viral, integrated into the genome or not 

Also contemplated are inhibitory nucleic acid therapeutics which can inhibit the activity of 
RRV, for example in subject with KS or other diseases associated with RRV infection. Similarly, 

25 inhibitory nucleic acid therapeutics are provided which can inhibit the activity of JMHV, for example 
in subject with MS or other diseases associated with JMHV infection. Inhibrtory nucleic acids may 
be single-stranded nucleic acids, which can specifically bind to a complementary nucleic acid 
sequence. By binding to the appropriate target sequence, an RNA-RNA, a DNA-DNA, or RNA- 
DNA duplex or triplex is formed These nucleic acids are often termed "antisense" because they are 

30 usually complementary to the sense or cooing strand of the gene, although recently approaches for 
use of "sense" nucleic acids have also been developed The term "inhibitory nucleic acids" as used 
herein, refers to both "sense" and "antisense" nucleic acids. 

By binding to the target nucleic acid, the inhibitory nucleic acid can inhibit the function of 
the target nucleic acid This could, for example, be a result of blocking DNA transcription, processing 

35 or pory(A) addition to mRNA, DNA replication, translation, or proinoting inhibitory mechanisms of 
the cells, such as promoting RNA degradation. Inhibitory nucleic acid methods therefore encompass 
a number of different approaches to altering expression of RRVor JMHV genes. Cleavage, and 
therefore mactrvation, of the target nucleic acids may be effected by attaching a subsutuent to the 
inhibitory nucleic acid which can be activated to induce cleavage reactions. The subsutuent can be 
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one that affects either chemical, or enzymatic cleavage. Alternatively, cleavage can be induced by 
the use of ribozymes or catalytic RNA. In this approach, the inhibitory nucleic acids would include 
either naturally occnrring RNA (ribozymes) or synthetic nucleic acids with catalytic activity. 

The inhibitory nucleic acid therapies can be used to target nucleic acids to sequences of 
5 RRV or JMHV for use in treating conditions caused by the RRVor JMHV, or proteins of the RRV (or 
JMHV), for example for treating KS or KS-like syndromes, or MS or MS-hke syndromes. 

Administration of Antibodies 

10 Therapeutic, intravenous, polyclonal or monoclonal antibodies has been used as a mode of 

passive mrnurrotherapy of herpesviral diseases, such as infection with CMV. Imrmme globulin from 
subjects previously infected with the RRV and bearing a suitably high titer of antibodies against the 
virus can be given in combination with antiviral agents (e.g. ganciclovir), or in combination with 
other modes of irmramotherapy that are currently being evaluated for the treatment of KS, which are 

1 5 targeted to mnHiilfltfng the immune response (Le. treatment with copolymer- 1, anthdiotypic 

monoclonal antibodies, T cell 'Vaccination"). In one embodiment, antibodies specific for an epitope 
expressed on cells infected with the RRV are utilized and can be obtained as described above. 
Similarly immnne globulin from subjects previously infected with the JMHV and bearing a suitably 
high titer of antibodies against the virus can be given in combination with antiviral agents (e.g. 

20 ganciclovir), or in combination with other modes of immunotherapy that are currently being 

evaluated for the treatment of MS, which are targeted to modulating the immune response. In one 
embodiment, antibodies specific for an epitope expressed on cells infected with the JMHV, obtained 
as described above, are utilized. 

The present invention also provides phannaceutical compositions which include a 

25 therapeutically effective amount of the antibody, and a pharmaceutical^ acceptable carrier or 
excipient 
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EXAMPLE26 



Vaccines 



10 



15 



20 



25 



30 



This invention provides substances suitable for use as vaccines for the prevention of diseases 
associated with RRV infection, such as KS, and diseases associated with JMHV infection, or 
infection by related viruses which cause disease such as MS, and methods for administering them. 
Particular vaccines are directed against RRV ore related viruses, and may include antigens obtained 
from RRV or those related viruses, or the vaccines are directed against JMHV or related viruses, and 
may include antigens obtained from JMHV or its related viruses, in one embodiment, the vaccine 
contains attenuated RRV or JMHV, or related viruses found in humans. In another embodiment, the 
vaccine contains killed RRV or JMHV. In another embodiment, the vaccine contains a nucleic acid 
vector encoding RRV or JMHV, or a surface protein of RRV or JMHV, such as a capsid protein. In 
another embodiment, the vaccine is a subunit vaccine containing an RRV or JMHV summit, such as 
glycoprotein B, major capsid protein, or other gene products found to elicit appropriate humoral 
and/or cell mediated immune responses. 

Tins invention also provides a method of vaccinating a subject against Kaposi's sarcoma and 
lymphoproliferative disorders, comprising administering to a susceptible subject an effective amount 
of the peptide or polypeptide encoded by an isolated DNA molecule encoding a polypeptide or 
combination of polypeptides expressed by the DNA molecule, and a suitable acceptable carrier. In 
one embodiment, naked DNA is adrninistered to the subject in an effective amount to vaccinate the 
subject against Kaposi's sarcoma and lymphoproliferative disorders, or other disease associated with 
RRV infection. 

In another embodiment, a method is provided for vaccinating a subj ect against multiple 
sclerosis, comprising administering to a susceptible subject an effective amount of the peptide or 
polypeptide encoded by an isolated DNA molecule encoding a polypeptide or combination of 
polypeptides expressed by the DNA molecule, and a suitable acceptable carrier. In one embodiment, 
naked DNA is administered to the subject in an effective amount to vaccinate the subject against 
multiple sclerosis, or other disease associated with JMHV infection. 

The vaccine can be made using synthetic peptide or recombinant! y-produced polypeptide 
described above as antigen. Typically, a vaccine will include from about 1 to 50 micrograms of 
antigen, for example from about 15 to about 45 micrograms. Typically, the vaccine is formulated so 
that a dose includes about 0.5 milliliters. The vaccine may be adrninistered by any route known in 
the art, for example parenteral, subcutaneous or intramuscular. 

There are a number of strategies for amplifying an antigen's effectiveness, particularly as 
related to the art of vaccines. For example, cyclization of a peptide can increase the peptide's 
antigenic and immunogenic potency. See U.S. Pat No. 5,001,049. More conventionally, an antigen 
can be conjugated to a suitable carrier, usually a protein molecule. This procedure can allow multiple 
copies of an antigen, such as a peptide, to be conjugated to a single larger carrier molecule. 
Additionally, the carrier may possess properties which facilitate transport, binding, absorption or 
transfer of the antigen. 
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For parenteral administration, such as subcutaneous injection, examples of suitable carriers 
are the tetanus toxoid, the diphtheria toxoid, serum albumin and lamprey, or keyhole limpet, 
hemocyanin because they provide the resultant conjugate with minimum genetic restriction. 
Conjugates including these universal carriers can function as T cell clone activators in individuals 
5 having very different gene sets. The conjugation between a peptide and a carrier can be 

accomplished using one of the methods known in the art Specifically, the conjugation can use 
bifunctional cross-linkers as binding agents as detailed, for example, by Means and Feeney, "A recent 
review of protein modification techniques," Bioconjugate Chenu 1:2-12 (1990). 

Vaccines against RRV can be made from the RRV envelope glycoproteins, and vaccines 

10 against JMHV can be made from the JMHV envelope glycoproteins. These proteins can be purified 
and used for vaccination (Lasky, L. A., 1990, J. Med. Virol 3 1 :59). MHC-bmdrng peptides from cells 
infected with the human herpesvirus can be identified for vaccine candidates per the methodology of 
Marloes, et al., 1991, Eur. /. Immunol 21:2963-2970. The RRV or JMHV antigen may be combined 
or mixed with various solutions and other compounds as is known in the art For example, it may be 

1 5 administered in water, saline or buffered vehicles with or without various adjuvants or 

rmmunodilutmg agents. Examples of such adjuvants or agents include ahimimnn hydroxide, 
afominum phosphate, aluminum potassium sulfate (alum), beryllium sulfate, silica, kaolin, carbon, 
water-in-oil emulsions, oil-in-water emulsions, muramyl dipeptide, bacterial endotoxin, lipid X, 
Corynebacterium parvum (Propionibacterium acnes), Bordetella pertussis, polyribonucleotides, 

20 sodium alginate, lanolin, lysolecithin, vitamin A, saponin, liposomes, levanrisole, DEAE-dextran, 
blocked copolymers or other synthetic adjuvants. Such adjuvants are available commercially from 
various sources, for example, Merck Adjuvant 65 (Merck and Company, Inc., Rahway, N J.) or 
Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, Mich.). Other 
suitable adjuvants are Amphigen (oil-in-water), Alhydrogel (aluminum hydroxide), or a mixture of 

25 Amphigen and AlhydrogeL Only ahirrrirnim is approved for human use. 

The proportion of antigen and adjuvant can be varied over a broad range so long as both are 
present in effective amounts. For example, aluminum hydroxide can be present in an amount of 
about 0.5% of the vaccine mixture (A1 2 0 3 basis). On a per-dose basis, the amount of the antigen can 
range from about 0. 1 ug to about 100 ug protein per subject, for example about 1 ug to about 50 ug 

30 per dose, or about 15 ug to about 45 ug. A suitable dose size is about 0.5 ml Accordingly, a dose for 
intramuscular injection, for example, would comprise 0.5 ml containing 45 ug of antigen in 
admixture with 0.5% ahmnnum hydroxide. After formulation, the vaccine may be incorporated into 
a sterile container which is then sealed and stored at a low temperature, for example 4°C, or it may 
be freeze-dried. Lyophflization permits long-term storage in a stabilized form. 

3 5 The vaccines may be administered by any conventional method for the administration of 

vaccines including oral and parenteral (e.g., subcutaneous or mtramuscular) injection, mtramuscular 
administration is preferred. The treatment may consist of a single dose of vaccine or a plurality of 
doses over a period of time. Also, the antigen could be a component of a recombinant vaccine which 
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is adaptable for oral administration. Vaccines of the invention may be combined with other vaccines 
for other diseases to produce multivalent vaccines. A pharrraceuricalfy effective amount of the 
antigen can be employed with a pharmaceutically acceptable carrier such as a protein or diluent 
useful for the vaccination of mammals, particularly humans. Other vaccines may be prepared 
5 according to methods well-known to those skilled in the art 

Those of skill will readily recognize mat it is only necessary to expose a mammal to 
appropriate epitopes in order to elicit effective immunoprotectioa The epitopes are typically 
segments of amino acids which are a small portion of the whole protein. Using recombinant genetics, 
it is routine to alter a natural protein's primary structure to create derivatives embracing epitopes that 

10 are identical to or substantially the same as (immunologically equivalent to) the naturally occurring 
epitopes. Such derivatives may include peptide fragments, amino acid substitutions, amino acid 
deletions and amino acid additions of the amino acid sequence for the viral polypeptides from the 
human herpesvirus. For example, it is known in the protein art that certain amino acid residues can 
be substituted with amino acids of similar size and polarity without an undue effect upon the 

15 biological activity of the protein. The human herpesvirus proteins have significant tertiary structure 
and the epitopes are usually confoimational Thus, modifications should generally preserve 
conformation to produce a protective rmmimft response. 

EXAMPLE 27 

20 Peptide Synthesis and Purification 

The peptides provided by the present invention can be chemically synthesized by any of 
a number of manual or automated methods of synthesis known in the art For example, solid phase 
peptide synthesis (SPPS) is carried out on a 0.25 millimole (mmole) scale using an Applied 
Biosystems Model 431 A Peptide Synthesizer and using 9-fluoreiiyhTjemyloxycarbonyl (Fmoc) 

25 ammo-ternrinus protection, coupling with mcyclohexylcarbodiimide/ hydroxybenzotriazole or 2-(lH- 
benzo-triazol- 1 -yl}- 1,1,3 ,3-tetramemylurorrium hexafluorophosphate/ hydroxybenzotriazole 
(HBTU/HOBT), and using p-hydroxyrnemylphenoxymera^ (HMP) or Sasrin resin for 

carrx>xyl-terniinus acids or Rink amide resin for carboxyi-tenninus amides. 

Fmoc-derivatized amino acids are prepared from the appropriate precursor amino acids 

30 by tritylation and triphenylmethanol in triffuoroacetic acid, followed by Fmoc derivitization as 
described by Atherton et aL (Solid Phas e Peptide Synthesis. IRL Press: Oxford, 1989). 

Sasrin resin-bound peptides are cleaved using a solution of 1% TFA in m'cUoromethane 
to yield the protected peptide. Where appropriate, protected peptide precursors are cyclized between 
the amino- and carboxyl-tennini by reaction of the ammo-terminal tree amine and (^rboxyl-tenninal 

35 free acid using diphenyrphosphorylazide in nascent peptides wherein the amino acid sidechains are 
protected. 

HMP or Rink arnide resin-bound products are routinely cleaved and protected 
sidechain-contaming cyclized peptides deprotected using a solution comprised of trifmoroacetic acid 
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(TFA), optionally also comprising water, thioanisole, and etrianedithiol, in ratios of 100 : 5 : 5 : 2.5, 
for 0.5 - 3 hours at room temperature. 

Crude peptides are purified by preparative high pressure liquid chromatography 
(HPLQ, for example using a Waters Deha-Pak CI 8 column and gradient ehition with 0.J% TFA in 
5 water modified with acetonitrQe. After cohrmn ehition, acetonitrile is evaporated from the eluted 
fractions, which are then lyophilized. The identity of each product so produced and purified may be 
amfinned by fast atom bombardment mass spectroscopy (FABMS) or electrospray mass 
spectroscopy (ESMS). 

10 EXAMPLE 27 

Spontaneous Demyleniating Encelpholmyletis in Japanes Macaques 
Multiple sclerosis (MS) is a chronic, debilitating inflammatory disease limited to central 
nervous system (CNS) white matter for which there are very few effective treatments. The disease 
may not be etiologicalfy homogeneous, but rather a complex set of diseases that have in common 

1 5 pathogenic mechanisms that involve genetically predisposed individuals, and infectious agents as 
initiators and diverse mechanisms of inflammatory white matter destruction (Hafler, J Clin Invest 
104:527-529, 1999). Genetic studies in families with MS-afFected members have revealed that MS is 
a complex trait, that the contribution of individual genes to susceptibility is probably small, and mat 
differences are possible between familial and sporadic forms of the disease (Kalrnan amd Lublin, 

20 Biomed Pharrnacomer 53:358-370, 1999.). 

The inflammat ory response in MS is characterized by T lymphocyte-mediated 
demyehnation of the CNS and by autoimmune responses to myelin proteins such as myelin basic 
protein (MBP), proteolipid protein (PLP), and myelin ohgodendrocyte glycoprotein (MOG) (e.g. 
Wekerle, Curr Opin Neurobiol 3:779-784, 1993). No aspect of the inflammatory response in MS has 

25 been identified as being specific for the disease (Prineas and McDonald, In: Graham DI, Lantos PL 
(eds.), Greenfield's Neuropathology, 6 th Edn. Oxford Univ. Press, New York, pp. 813-896, 1997). 
The lesions are vasocentric and initially characterized by leukocyte investment of small blood 
vessels, endothelial cell activation, edema and myelin swelling. Secondarily, there is macrophage 
and microglial cell activation and selective myelin destruction. Chronic-active lesions are 

30 characterized by ongoing myelin breakdown and foamy macrophages are actively engaged in 

removing and digesting myelin. Oligodendroglial cells are typically reduced in number, astrocytes in 
the denryelinated zone are reactive and axons are, for the most part, preserved. Chronic plaques are, 
in general, quiescent glial scars. There is extensive variability in the clinical course of the disease 
and heterogeneity in the distribution and character of the lesions and there are five, and possibly six, 

35 clinical syndromes that seem to belong to the narrow group of immune-mediated demyelinatmg 
diseases considered to be MS or MS variants (Hickey, J Neuroimmunol 98:37-44, 1999). 

Epidemiological evidence strongly supports the hypothesis that MS may be initiated by a 
virus infection. However, no single virus has been consistently found in MS lesions and no fewer 
than 16 infectious agents have been associated with the disease (Johnson, Viral Infections of the 
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Nervous System, 2 nd Ed, Lippincott-Raven, Philadelphia, 1998). The JMHV, and related human 
viruses, are believed to be involved in the pathogenesis of this constellation of diseases. 

Detenrnning the mechanisms by which infections trigger autoimmune sequelae is an area of 
great interest, and may provide important clues about the pathogenesis and regulation of autoimmune 
5 diseases. While there is no evidence of an active cytolytic viral infection in the CNS in MS, viruses 
may participate in the pathogenesis of MS in one or more ways. First, viruses are postulated to share 
sequence homologies with myelin epitopes and, through the mechanism of molecular mimicry, 
activate autoreactive T cells (e.g. see Firjinarrri and Oldstone, Science 230:1043-1045, 1985). In 
general, there is little direct evidence to support this mechanism. Second, viral antigens expressed in 

10 the CNS as a result of chronic, non-cytolytic infection may serve as targets for virus-specific T cells 
entering into the CNS resulting in non-specific inflammatory, or bystander damage to myelin and 
myelin-producing oligodendrocytes (Miller and Gerety, Semm Virol 1:263-272, 1990). Third, CNS 
viral infections that initially cause cytolytic or bystander damage to myelin secondary to virus- 
specific T cells may promote induction of autoimmunity to the myelin proteins released by viral 

15 damage to the CNS by epitope spreading (Lehmann et aL, Nature 558:155-157, 1992). 

Several virus-induced and autoimmune models have been used to study the underlying 
mechanisms of myelin destruction in MS. These models include demyeHnating diseases induced by 
infection with mouse hepatitis virus, Sindbis virus, Semliki Forest virus, herpesvirus, or Theilefs 
murine encephalomyelitis virus, as well as immunization with CNS autoantigens. Although these 

20 models have provided a wealth of knowledge about mechanisms of inflammatory demyelination and 
are attractive parallel models for human MS, their direct relevance to MS is unknown. 

Demyelinatiug encephalomyelitis with morphologic features similar to multiple sclerosis 
was first observed in the Oregon Regional Primate Research Center Japanese macaque troop in July, 
1986. The troop originated from 55 founder animals imported from Japan in 1965 and has been 

25 maintained as a close group. No additional animals have been introduced (Eaton, Adv Behav Biol 
1 1:287-297, 1974). The incidence of the disease peaked the following year, affecting 3% of the 
population. Subsequently, approximately 1% of the population developed ^myelinating 
encephalomyelitis annually. The age of the affected animals ranges from 97 days to 20 years, with 
approximately equal sex distribution (20 males and 1 8 females). The mean age at disease onset is 5 

30 years, 22 days and young animals are over-represented among the 38 cases identified to date (median 
age = 3 years, 284 days). 

Clinically, the disease is characterized by acute onset paresis or paralysis involving one or 
more limbs in robust, healthy animals that lack evidence of chronic disease. The severity is variable 
ranging from limb weakness to tetraparesis. Cerebrospinal fluid analysis generally reveals moderate 

35 to marked pleocytosis dominated by lymphocytes and elevated protein (Table 2). Regardless of the 
initial severity, the disease is uniformly progressive and necessitates euthanasia of the animal. 
Treatment with high doses of corticosteroids delays progression for several days to weeks. However, 
all animate treated with corticosteroids to date have manifested relapsing, progressive disease. 
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Table 2- 






Japanese Macaque Demy elina ting Encephalomyelitis, CSF Analysis 


IDNo. 


Glucose 
(mg%) 


Total Protein 
(mg/dL) 


WBC(DL) 
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PMN(%) 
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100 
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195 


12 


100 


0 


19400 


nd 


144 


341 


93 


7 














Range 
(Mean) 


33-58 
(48) 


11-195 
(75) 


0-2,710 
(260) 


(89) 


(5) 



Pathologically, Japanese macaque demyelinating encephalomyelitis is characterized by 
multifocal dull yellow or pale tan plaques mat range from 0.2 to 1.0 cm in greatest dimension 
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distributed primarily in the cerebellar and spinal cord white matter. Gray matter is exquisitely spared 
Most animals lack lesions in the mid- and forebrain. Histologically, they are vasocentric and vary 
considerably in age. Acute lesions are dominated by perivenous lymphocytic cuffs that frequently 
track perforating venules from pia-arachnoid surfaces into adjacent white matter. The infiltrating 
5 lymphocytes frequently have large, activated nuclei as do endothelial cells lining the venules within 
the lymphocytic cuffs. Hie adjacent myelin is edematous, vesiculated and sparsely infiltrated with 
lymphocytes and macrophages. Occasionally, small focal hemorrhages are present in the edematous 
myelin. Subacute lesions are densely infiltrated with macrophages and lymphocytes and there is 
conspicuous myelin damage and phagocytosis of myelin debris by macrophages. Necrosis and 

10 microcystic spaces are dorninant features in some subacute lesions. Chronic lesions consist of crisp 
plaques composed principally of fibrous astrocytes with foamy macrophages and occasional 
lymphocytes entrapped in the meshwork of glial fibers. The propensity for these lesions to occur 
only in white matter, their inflammatory dernyelinating character and variable age are recognized 
features of MS. The acute fulminant course despite the histologic evidence of bom acute and chronic 

1 5 lesions, the presence of necrosis in some subacute lesions and the predilection for the cerebellum and 
spinal cord are most consistent with the neuromyelitis optica or Devic's disease variation of MS 
(Prineas and McDonald, Im Graham DI, Lantos PL (eds.), Greenfield's Neuropathology, 6 th Edn. 
Oxford Univ. Press, New York, pp. 813-896, 1997; Raine, In: Davis RL, Robertson DM (eds.), 
Textbook of Neuropathology, 3 rd Edn. Williams and Wilkins, New York, pp. 627-714, 1997). While 

20 Devic's disease has clinical and pathologic features that are distinct from those of classical MS, 
individuals that survive the acute episode often progress to develop a disease course typical of 
classical MS, suggesting that the underlying process is similar or identicaL 

25 EXAMPLE 28 

Isolation of Japanese Macaque Herpesvirus 17792 

Attempts to isolate bacteria from cerebrospinal fluid and lesioned brain in these animals 
were negative in all cases, as were initial attempts to isolate viruses by moculatmg cell cultures with 

30 lwmogenized brain. Thus, primary tissue explant cultures and cocultivation of leukocytes with target 
cells were used to enhance the probability of isolating highly cell-associated agents. Fresh tissue was 
dissected from visible spinal cord lesions identified during necropsy examination of Japanese 
macaque 17792 on 05/22/95. Spinal cord tissue explant cuftures were established from small spinal 
cord tissue fragments cultured in T-25 cultures flasks in Dulbecco's Modified Eagle Medium 

35 (DMEM, Gibco) supplemented with 20% fetal bovine serum, 1% L-glutamrne and 1% pemcillin- 

streptorjD^cin-neorrrycin. Spinal cord explant cultures were fed by partial medium replacement every 
7-10 days and passaged at confluence after 30 days in culture. Confluent monolayer explant cultures 
were rinsed in calcium- and magnesiun>free phosphate-buffered saline, pH 7.4, digested with 
trypsin-EDTA at 37°C, washed with growth medium and seeded into T-25 culture flasks. First 
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passage spinal cord explant cultures were fed by complete medium reptecement at 5 to 7 day intervals 
and developed cytopathic effects (CPE) 7 days after passage. Cytopathic effects spread through me 
first passage spinal cord explant cultures after 14 days in culture and they were rapidly frozen in 
liquid N 2 and thawed. Cell-free supematants were clarified by centrifugation, filtered through a 
S 0.45|i membrane and stored frozen at -80°C. 

Peripheral blood mononuclear cells (PBMC) were isopycnic gradient-purified (FicoD-Paque, 
Pharmacia) from fresh ammonium heparin arm-coagnlated blood obtained from Japanese macaque 
17792 at necropsy. Gradient-purified PBMC were washed in calcium- and magnesium-free 
phosphate-buffered saline and 3xl0 6 PBMC were cocultured with 1x10 s primary rhesus macaque 

10 fibroblasts in T-25 culture flasks containing DMEM growth medium supplemented with 10% fetal 
bovine serum, 1% l^glutanrine and 1% perucillir^treptomycin-neomycirL The PBMC-fibroblast 
cocuhures were fed by partial medium replacement after 3 days in culture and passaged at confluence 
at day 9. Confluent PBMC-fibroblast cocultures were rinsed in calcium- and magnesium-free 
phosphate^jufTered saline, pH 7.4, digested with trypsm-EDTA at 37°C, washed with growth 

IS medium and seeded into T-25 culture flasks. First passage PBMC-fibroblast cailtures were fed by 
complete medium replacement at 3 to 5 day intervals and exhibited CPE 12 days after passage. 
Cytopathic effects spread through the first passage PBMC-fibroblast cultures after 15 days in culture 
and the cultures were rapidly frozen in liquid N 2 and thawed. Cell-free supematants were clarified by 
cerjrxifugation, filtered through a 0.45 micron membrane and stored frozen at -80°C. 

20 To prepare virus stocks, cell-free filtered supematants harvested' from the spinal cord explant 

cultures and the PBMC-fibroblast cocultures were separately inoculated onto primary rhesus 
macaque fibroblast cultures. Fibroblast cultures inoculated with spinal cord explant supernatant and 
PBMC-fibroblast cocuhure supernatant developed CPE 5 days post-inoculation. The cultures were 
rapidly frozen in liquid N 2 and thawed and cell-free supematants were clarified by cenrrifugation, 

25 filtered through a 0.45micron membrane and stored frozen at -80°C. Using similar techniques, vero 
cells (American Type Culture Collection ATCC CCL-8 1) were shown to be susceptible to infection 
and suitable for propagation of virus. 

For transmission electron microscopy, passage primary spinal cord explant cultures 
exhibiting CPE and fibroblast cultures developing CPE following inoculation with cell-free filtered 

30 supernatant from primary spinal cord explant cultures or PBMC-fibroblast cocultures were scraped 
free into medium, pelleted at 400 xg, washed in calcium- and magnesium-free phosphate-buffered 
saline and suspended in cold ITO and Karoofsky's fixative (2.5% grutaraldehyde, 0.5% picric acid, 
1 .6% paraformaldehyde, 0.005% ruthenium red) in 0. 1 M sodium cacodylate buffer, pH 7.4, for 2 
hours. Fixed cells were washed in cacodylate buffer, post-fixed in 1% OSO4 and 0.8% K3Fe(Cn) 6 in 

35 cacodylate buffer for 1 hour, rinsed in distilled H2Q and pre-stained in 2% aqueous uranyl acetate for 
1 hour. Fixed and pre-stained cells were dehydrated in a graded series of acetone imbedded in Epon 
8 12 epoxy resin, polymerized at 60°C and sectioned at 60 nm on an MT 5000 ultrarmcrotome. 
Copper grid mounted sections were stained with lead citrate and uranyl acetate and viewed in a 
Phillips 300 electron microscope. Numerous herpesvirus particles were observed in the primary 
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spinal cord explant cells and in fibroblasts inoculated with cell-free filtered supernatant from primary 
spinal cord explant cultures and the PBMC-fibroblast cocultmes. 



To retrospectively and prospectively evaluate the relationship between Japanese macaque 
herpesvirus (JMHV) infection and encephalonryehris, animals were identified in the troop that were 
actively infected with JM herpesvirus by coculu vating PBMC with rhesus macaque fibroblasts (see 
above), the seroconversion rate using an enzyme-linked irrrrmmoassay (ELISA) constructed with 

10 strain 17792 JM herpesvirus was established as follows. JM herpesvirus straml7792-infected 
primary rhesus macaque fibroblasts were solubilized with 0.5% Nonidet P-40 and 1% sodium 
deoxycholate in phosphate-buffered saline, pH 7.2, and clarified in a Beckman SW28 rotor at 23,500 
rpm for 1 hour at 4°C. The clarified supernatant was used as antigen for coating 96-well Maxisorp 
plates (2 ng/ well) (Nunc, Roskilde, Denmark). HLISAs were then performed essentially as described 

1 5 (Kodama et aL, AIDS Res Hum Retroviruses 5:337-343 1989). 

In 1995-1996, 70-75% of the infected animals were less than 7 years of age. Analysis of 
samples from these animals revealed that 16% of the animals in the troop had culturable simian 
spumarerrovirus (simian foamy virus ; SFV) and 8 % of the animals in the troop had JM herpesvirus in 
peripheral blood leukocytes (PBL). Virus isolation from PBL was repeated for 199 animals in 1997; 

20 SFV was recovered from 1 1 (5.5%), the JM herpesvirus was recovered from 120 (603%), and 5 
(0.25%) had dual infections. 

Hie number of JMHV-positive animals decreased with age; 97% of animals less than 1 year 
of age and 27% of animals greater than 10 years of age were isolation-positive. One hundred percent 
of the animals greater man 300 days of age were seropositive for JM herpesvirus. Eight antmalg 

25 developed derrryeliriatmg encephalomyelitis in 1995-1997 and all were seropositive for the JM 

herpesvirus, and 6 of 8 were animals previously identified as JM herpesvirus isolation-positive. Two 
of the JM herpesvirus-infected animals were co-infected with SFV. SFV was isolated from 1 of 2 JM 
herpesvirus isolation-negative atiimak and no virus was isolated in the remaining animal. Brain and 
spinal cord were available for virus isolation from 7 of the 8 animals with demyelinaring 

30 encephalomyelitis; MRV was isolated from lesioned brain or spinal cord explant cultures from 3, 
both MRV and SFV were recovered from 2, 1 yielded only SFV and virus was not isolated from 1. 
Virus was not recovered from explant cultures of normal appearing white matter from the cerebrum 
in any animal - 



JMHV Sequence 

Hie Japanese macaque virus isolate was isolated from a lesion that was minced and co- 
cultured with primary rhesus fibroblasts. The isolate was then cloned by limiting dilution and a stock 
of virus generated from this clone. Total cellular DNA was harvested from virus infected cells and 
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the DNA subjected to degenerate PCR far viral DNA polymerase, exactly as described above for 
RRV. Once confirmed, a cosmid library of this virus was made from purified viral DNA (as 
described for RRV) and then a portion of the protein genes was cloned and sequenced. 

Cosmid 3 is an isolate from a Japanese macaque virus library. It contains about 35,000 bp 
S of viral sequence and 7000 bp of vector. The viral sequence was sub-cloned using Eco Rl. The 
fragments below are these EcoRl fragments. The terms T7 and SP6 are the names of universally 
used primers for sequencing - they sequence opposing ends of the viral fragment Because the 
lengths of the viral inserts vary, and may be quite large, the open reading frame round at one end of 
the insert may or may not differ from the open irading frame found at the other end of the insert 
10 Protein sequences are in the standard single letter code. 

Cosmid 3 Fragment 7 T7 Sequence 
Qrf 21 - Thymidine kinase 

15 DNA sequence (SEQ ID NO:2) 

GAATTCATATTGCATATOTCCICT 

TCCGCTCTGAGATATTGCATC 

TACCTGTCGCAAGTAGTTTTOCGTGACT 

OTCCTGTGKXKXJKXGAACTGTTXM 
20 CGAGAAAAGAGATAGGATITGAAACAAATGATC 

GAACCAGCGGAAAAACAACGGTGGCGGACAGTAGGTGTCT 

CGCTGGCITIXntjCT 

GGCCGTTGCA 

25 Deduced amino acid sequence (SEQ ID NO:3) 
ATAAGIGRNI^PWLVGNGSTKPANWIVro 
AHDGDVVYlXTlJSfSSEAHRRIQSRGRKEEKGITQN^ 
QMVQLCJv-QTVSIEDICNMN 

30 

Cosmid 3 Fragment 7 SP6 sequence 
Orf 21 - Thynudine kinase 

DNA sequence (SEQ ID NO:4) 
35 GAATTGAATGACIXXKIATGGTGCCG 
AATTTTGGACGTGTGTITAT^ 

GGCACCCACGGGAAATOTATCACTTCTGCrCGCGTATACGCCT 

GCTACCTrTTCGTGCAACXjGCCGCCGG 

GAAACGGTAGCACAAAGCCAGCGAATTGGAlTGTrTITC 
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acx:gttgititixxgctggtt 
gaaatcctatctcttttc 

AGTTCXKjAGGCGCACAGGCXKATTCAAAGTCXjA 
CCCAAAACTACITGCGACAGGTAGCGTG^ 

5 

Deduced amino acid sequence (SEQ ID NO: 5) 
NSMTGMVPQEmTLSCTEPMKFWTCVYSNC^^ 
FRATAAGIGRNIXJPWLVGNGSTKPANWlYro 
FSAHDGDVVVIXTI^SSEAHRRIQSRGRKEEKGITQNY^ 

10 

Cosmid 3 Fragment 5 17 

No ORF associated, but highly similar to the DNA sequence found in and to the right of RRV repeat 
unitrDL-Bl. 

15 

DNA sequence (SEQ ID NO:6) 

AGCGAGTCMGTGACfcGAGGAAGCGGAAGAG 

CGCOTTGGCCGATTCATTAATGG^ 

ATAGGGAGACCGGCCTrcAGCAGCTGAA 
20 TCCCCGGGTACCGAGCT(^AATTC£AG 

GCCGCCTAGCTCCTAATGTTTGCCTTGCC^ 

CTCCTAATGTTTGCCTTGCXXjCCT 

TTTGCCITGCXXKXTAGCTCCTAATGTITG 

CCGCCTAGCTCXn'AATGTTTGC^ 
25 TAAGC^CACTAAAATTTAAAAACGTTTGTG 

CIXKJIXjGTGATTTTTTA(XCAACAAA 

TGTCTGTGGTAATTGKX5TTCOCCGAGGGT 



30 Cosmid 3 Fragment 5 SP6 
Qrf 17 - a capsid protein 

DNA Sequence (SEQ ID NO:7) 
GAATTCX3CAGCCCCTCTTGAAACACTGATGGC 
35 AGATCX5CACAGA<XTGCTQVAAACAGATAAAGG^ 
TATTTAAAGG<XAGCCAGTCTCC^ 
CATGAGCGC<XTCCCGGATGAC^ 

GCAGAGCAGTCTCGATCACATGCGCAA(XAGGGCCAA(XjCGCCT 
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<XTCGATG<XGGCAA(XXXXX^ 
CCGTCGTACGCGCCGCC^ 

Deduced ammo acid sequence (SEQ ID NO:8) 
5 EFAAPI^TLMAKAIDAGFIRDRTO 
AIPDDNTITPKSTFLIMVQ 
VAPPFPFQSAF 

10 Cosmid 3 Fragment 3 T7 
Orf 21 Thymidine kinase 



DNA sequence (SEQ ID NO:9), includes frameshift mutations 
15 GRGCGCAGCGAGTCAGTGAG^ 

CT(XCCGCXjCGTTGG<XGATTCATTAATC 

ACTCACTATAGGGAGACCGGCCTCXjAGCAGCTGAAGCn 

TAGAGK3ATCC(XX3GGTACajAGCTCGAATTCCAGATT^ 

CAAAGTTGTCATGAACAGAGTATGATACCAATGGTGGTAGAAA 
20 GAAC^CX3TGACCTTAATGKL^GGTCT 

GATTTTAATTGTTGAOKXjGGAGAACATTTAC^ 

ttatgggcaggtaatgtgaaatgaggctattaaaccac^ 
ttgaaagctacattcaaacgctaaccagcitgga^ 
tttgtgtttgttggtaattacat 
25 tgta(xtggacntgaaatagttctt^ 

Deduced ainino acid sequence - region between rraineshifts (SEQ ID NO: 10) 
SIJIACRSTLEDPRVPSSNSRLTHRFLT^ 

EOlKLXJIIJVnD AGEHIJDDTCGLWGNry' GQVMSNEAJKPRAVNWP AIJBSYIQTLTSLESNA^ 

30 

Cosmid 3 Fragment 3 SP6 
Orf 24 Unknown function 

DNA sequence (SEQ ID NO:ll) 
35 GAATTCAATACGCTCCAAAAGATAGAGA 
CCAGATCXiACATAAAQNAGT^ 
CGATCX:AGTAAAGCACXJrAAGTTTACACAAOT 
GGTAATTAATACCAAAATCACGTGCCGGTCrcrCAC^ 
CAT(XCCAGGCTCACAAATAACirTGTAATGAAAAAAT^ 
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TCACTGTGAGCGTGTTCTACrCCX^ 

TAAGCGGGGACATGCTGCACTITATGTTCXKT 

TGAAGCACATTTT(XCGGTTTCGATTGOT 

TCGAAAACCAATACATAGTTAGACGGGGGCGGCGAGACGTT^ 

5 

Deduced amino acid sequence (SEQ ID NO: 12) 
IQYAPKDRDHNFHTDANQNPDRHKQVHHDHQ 
TKITOISLTGTFESIIDIPRLTNNI^ 
MFAMGNIItC^VKHIFPV^ 

10 

Cosnrid 3 Fragment 1 T7 
Orf 7 Transport protein 

DNA sequence (SEQ ID NO:13) 
15 GATCXCATAGAGTCCCTGTTCTGCGGC^ 
GCACTGAGCCGGGATTGTTCG^ 

GCGAAAACAAAACGAGCTGTTTACCAGACTCAACAGCATAT^ 
GAT03CAAAAACCXjG<X^CCC(XTCG^ 

GCAAGCGACGTCATTAAAGACGCACAATATCGCAAAGAACACT 
20 CXIAGGGACGGCTTTAAAAAACTAACAGAG 
AACXKXtCTCTGCATGCXXX3TATGGGGGGG^ 

Deduced amino acid sequence (SEQ ID NO: 14) 
DPffiSJ^CGGLFNSTODTINAISRD*^^ 
25 PATPSEPRTATVAATAASDVIKDAQYKKEQYMKKVARDG 
RVWGGVAYG 



Cosmid 3 Fragment 1 SP6 
30 NoORF. Similar to sequence to the left of the RRV repeat rDL-Bl 

DNA sequence (SEQ ID NO:15) 

GAATTCACACCCGAACAAATCTAAAAGCAAAATTACACAAATTAACAAAAA 
TAACACAGTTAAAAAATTCAATACATA 
35 ACAATTTTAAATTCAATTTAAAAATTCACCT 

AAAATCCTTTTAACCATAAAATTAGAACXjGCACT 
GGCAGCTTTGTGTTAAAGACCACGO 
ACTGGTTACGTTAGTTCATCTAAAAGCTTAACXj^ 
ATTGCTCCATTITAAATTAAAATTTAAAATCT 
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GCGACXjCGGGGCAGGTrGCGTAG 
TGCGGTTTACGAGCCATTGTITITG 



5 

Large EcoRl fragments were then former reduced using Kpn 1 

Cosnrid 3 EcoRl Fragment 1 Kpn 1 fragment 1 T7 
Orf 10 Unknown protein 

10 

DNA sequence(SEQ ID NO:16) 

GGTACCCCX^CGACTATAAAAAGAGACA(XCTGCTG^ 
GGTGCXjAGTAAGCT<XXXXXjAro 
GGGCGCTCTCCATTGACGCGT^^ 
15 AGTGTCACGTGTCTATGCGCGACAACG 

CAATGGGAACACCGGTGCGTTTCTITCGCGGCGAG 
AACGGCACGCCACGGTACAGCTCCX^XK5TGTGGGCCGC 
GACACTGGTGAACAGGTCGGAGCTOTGCGTCACGGA 
GCCCCAGCATCGGGAGACTGGTCG 

20 

Deduced amino acid sequence C-teimimis of protein (SEQ ID NO: 17) 

GTPTTIKRDTLVAAAAPCTVVRVSSADDA 

SMRDNGVHERMSH 

25 

Cosnrid 3 EcoRl Fragment 1 Kpn 1 tragment 2 SF6 
Orf 9 DNA polymerase 

DNA sequence (SEQ ID NO:18) 
30 GGTACCGAGCCGCITCCAGACG^ 

GGCGAACCGAGAACXXXTACCAGCGT^^ 

TACGCGAAGGT<XCAGCC£KjC^ 

CACCGCCGGCAGGGCCGCGTGCGGC^ 

AAACGTACGACGTCGCGGAGCATC(XGTCACrrGAAATCACGCT 
35 CTCTCGA(XCT(^GCGA<XGCCTCGTCGCCT 
GGACGCCGTTCXXXXKjITCCT 

CXjCGCGTGCCACGCCCCGCCTAGCGGNCAGAGATGCCAGAACXjG^ 
GCAGCTGGGAGGACCTCAGCGTTC 
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Deduced amino acid sequence (SEQ ID NO: 19) 

wsitfQTDnpscmnLX^ 

GRAACGFSTRRVTKKIIJnYDVAEHPV^ 
FVDDHGFTTFGWYSCARATPRIAXRDA^ 

5 

Cosmid 3 EcoRl Fragment 1 Kpn 1 fragment 2 T7 
Orf 10 Unknown protein 

10 DNA sequence (SEQ ID NO:20) 

GGTACCGTTGGGGTTC^ 

GGGTGACGATGAATAGGGGCTCCGG^^ 

CGAGCXKjCGCrcCXJTGTTGGAAAGATTAACT 

TTGGATGCfTlTITrcGGCGTCGG 
15 CAGGACGGGTCCGGACTGGCCCAAGACTATCT 

TGATTTTCATTTGCGCCG 

Deduced amino acid sequence (SEQ ID NO:21) 
AQMKEYAPGDPNAEIVLGQSGPVIPraTCG 
20 ARGDIAFLVAGVAPEPIJFTVTPTIXLSGCl^ 



Cosmid 3 EcoRl Fragment 1 Kpn fiagment 3 SP6 
Orf 8 Glycoprotein B 

25 

DNA sequence (SEQ ID NO:22) 
GGTACCGCAAGGTGGCCACCT^ 
ACCGGTAAGCAAGAGGTCATTCGACCXjCT 
GACCTACCAGTGTTTCAGCTCCATGCXjC^ 
30 CGGACAGGGACTTCACTAACCAGACCXjTGTTCCTG^^ 

aacattcagcgatacttcagtc^gccgct 

aatctacagggt(xgaaccacxxn , caacrgcgagatcgtggaca 

cggaaccgtactcitactttgtca(xx^ 

GCTTAAACXlACTCX3ACXnXjCTCCCT 
35 GTGCTTACAAATTACACCATTGTrGAC^ 
GTTTTCG^GACTOjGGAGAATACAC^^ 
AGTCTGTGCGCTGACGCrCTGGAAAACCTT^ 
GCC^GCTACCACTTTGTGGCCAACGACGTG 
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Deduced amino acid sequence (SEQ ID NO:23) 
YRKVATSVTNT^GWTETAVraKQEVIRPW 
DFTOQTVnXJPVEGLTDMQRYFSQPVLYTIPGWFPGIY^ 
VTAl^DlAnBVSPFm^STCSVADKAEN 
5 VSWKAEDPKSAVCALTLWKTFPRAIQTTBDESQIJ 

Cosnrid 3 EcoRl Fragment 1 Kpn 1 fragment 3 T7 
Orf 9 DNA polymerase 



DNA sequence (SEQ ID NO:24) 
GGTACXTrCGGCGC^CCTGTC 
CX3TAATGGATTGCTXnTGGGGGT(XGGGACGACT 
TGGACOGTIXXnXKKKX5AGCAGCACGTC^ 
15 CK3GAACGGGATCGTGA(XGGGATCATCC^ 
AATGAGCCHGCAGACGTCTGGTC^^ 
GTGCATTTGTGTGAGGGGGGGCGTC^ 
CATGCTCGATACAGAACGCTOACGCCCGGGGGCGCC 

20 Deduced amino acid sequence (SEQ ID NO:25) 

GAPGRQRSVSSMDFFNPYIXjPRGPRPPSHKCTO 

MIPVTIPFPPTYFENGARGDV 

RCAEV 



Cosnrid 3 EcoRl Fragment 1 Kpn 1 fragment 4 T7 
Orf 8 Glycoprotein B 

DNA sequence (SEQ ID NO:26) 
30 GGTACCGCXTTAACCTTAAAGAT 

CTTCTTGGTGCGTlTrGNCCTCGGTGCCAGGACACCT 

GTTCGCCGGTGGCOSAGGC^ 

TTTTCXKK^GGTGTGGGCGTCGACGGGCXX 

CTITGGGGTGGTGACGTTTTCGC^ 
35 CCGCAGCAGACGACGCGTCCGGTTAGGTATCA 



10 



25 



Deduced arrrino acid sequence (SEQ ID NO J27) 

IPNRTRRIXRAWV^nttAIGAPVGENVTTC 

SATGEIJTRPNI^KTCPGTEXKra 
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The sequence of the Japanese macaque gamma? herpesvirus were compared with the rhesus macaque 
pmmia? herpesvirus 17577. The results are shown in Figs. 12-25. Based on a comparison of 2328 
5 amino acid residues from 14 independent regions (average length of 1 66+7 1 amino acids), the 
percent similarity between the two viruses at the protein level was found to be 94.12%. The range 
among the 14 regions was 75.652% to 99.422%. Hie lowest identity was found in the MIP region of 
the two genomes, while the highest identity was found in ORF 2 1 . The calculations are shown below 
in Table 3: 



Table 3 

Calculation of % Similarity and % Identity 


Gene Region 


Length of 
Alignment 


% similarity 


$ snrjdlar 


% identity 


# identical 


DL6 


207 


94.203% 


195 


91.304% 


178 


MIP 


120 


80.870% 


97 


75.652% 


73 


Orf07 


226 


98.230% 


222 


98.230% 


218 


Orf08 


109 


94.495% 


103 


94.495% 


97 


Orf08 


226 


97.235% 


220 


96.774% 


213 


Orf09 


120 


94.915% 


114 


93.220% 


106 


Orf09 


165 


98.182% 


162 


96.364% 


156 


OrflO 


104 


94.231% 


98 


93269% 


91 


OrflO 


76 


94.737% 


72 


93.421% 


67 


Orf 17 


135 


94.074% 


127 


93.333% 


119 


Orf21 


174 


99.422% 


173 


99.422% 


172 


Orf 24 


174 


95.977% 


167 


94.253% 


157 


Orf 29b 


140 


98.571% 


138 


98.571% 


136 


TS 


352 


93.994% 


331 


93.093% 


308 
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These results were then used to estimate the % similarity and identity across the viral 
genome, as shown below in Table 4: 



Table 4- 

Estimated % Similarity and Identity Across Entire Viral Genome 




smrilarity 




identity 










Mean 


95.011% 




93.716% 


StcL Error 


1.199% 




1.532% 










n 


14 




14 


A vg. length 


1663 




166.3 


Stddev 


70S 




70.5 



5 

Over the entire genome, the JMHV and the RRV17577 are estimated to have 95% 
similarity. The present invention includes the entire JMHV genome, and each of the peptides 
(including variants and fragments thereof), as well as DNA sequence which encodes them, such as 
the OFRs set forth in this specification. 

10 

EXAMPLE 31 
Reactivity of Human MS Patient Serum to 
Japanese Macaque 

15 To determine if there was evidence for JM herpesvirus or a closely related virus in MS 

patients, serum antibodies were measured mat reacted with strain 17792 JM herpesvirus. A panel of 
15 coded human serum samples that contained serum from normal control and MS patients were 
used. The results of this study are summarized in Table 5. Six of 15 samples yielded positive O J), 
values (> 0.150), 5 of the 6 positive samples were from MS patients. The 9 samples with OX), values 

20 of < 0. 1 50 included 5 MS patients and 4 controls. However, when the O JD. values were ranked, 8 of 
10 samples with the highest O D. values segregated to MS patients, suggesting that some MS patients 
have a humoral innrnme response to JM herpesvirus or a related human virus. 

The present invention includes the antibodies that recognize JM herpesvirus, as well as 
antigens recognized by these antibodies. Thus viral antigen (such as a viral antigens expressed in 

25 humans) which are recognized by an anti-JMHV antibody, are inlcuded herein. Examples of such 
antigens iclude antigens encoded by a human virus that is associated with the development of MS. A 
human viral antigen mat is encoded by a human virus can have, for example, 80%, 90%, 95%, or 
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98% sequence homology with a JMHV antigen encoded by the virus deposited as ATCC accession 
no.PTA-1884. 

Variants of the JMHV include variants that infect humans, and which axe associated with the 
development ofMS, or pathological lesions (such as dernyelination) that are associated with MS in 
S humans. Such variants can be isolated using the techniques disclosed in this specification for 

obtaining viruses with similar nucleic acid sequences, or the viruses can be recognized by the specific 
binding of simian antibodies against JMHV to the human variant, as illustrated in Table 5. 



Table 5 -MS and Control Patient Serum Reactivity 
to Japanese Macaque Herpesvirus 


Rank 


ID No. 


O _TJ. Value 


Status 


1 


4 


0.047 


MS 


2 


11 


0.052 


Control 


3 


g 


0.065 


Control 


4 


. 1 


0.082 


Control 


5 


3 


0.106 


MS 


6 


2 


0.112 


Control 


7 


10 


0.112 


MS 


8 


7 


0.120 


MS 


9 


5 


0.122 


MS 


10 


12 


0.169 


MS 


11 


15 


0.214 


MS 


12 


14 


0.253 


MS 


13 


13 


0.415 


MS 


14 


9 


0.419 


Control 


15 


6 


0.430 


MS 



10 
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Having illustrated and described the principles of cloning die RRV and JMHV nucleic acid 
molecules, cDNA, proteins encoded by the cDNA, and modes of use of these biological molecules, it 
should be apparent to one skilled in the art that the invention can be modified in arrangement and 
detail without departing from such principles. In view of the many possible embodiments to which 
5 the principles of our invention may be applied, it should be recognized that the illustrated 

embodiments are only examples of the invention and should not be taken as a limitation on the scope 
of me invention. Rather, the scope of the invention is in accord with the following claims. We 
therefore claim as our invention all that comes within the scope and spirit of these claims. 
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We claim: 

1. An isolated virus (Japanese Macaque Virus, JMHV) as deposited with ATCC as deposit 
accession number PTA-1 884. 

5 2. A host cell infected with the virus of claim 1. 

3. A purified polypeptide comprising an amino acid sequence that has at least 95% 
sequence identity to an ammo acid sequence as set forth as SEQ ID NO:3, SEQ ID NO:5, SEQ ID 
NO:8, SEQ ID NO: 10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO: 17, SEQ ID NO:19, SEQ ID 

10 N021, SEQ ID NO:23, SEQ ID NO:25, and SEQ ID NO:27. 

4. A purified polypeptide of claim 3, wherein the polypeptide has at least 98% sequence 
identity to an amino acid sequence as set forth as SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:8, SEQ 
ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:19, SEQ ID N021, SEQ 

15 ED NO:23, SEQ ID N0:25, and SEQ ID NO:27. 

5. The purified polypeptide of claim 2, wherein the polypeptide has an amino acid sequence 
as set forth as SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO: 12, SEQ ID 
NO:14, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, and SEQ 

20 ED NO:27. 

6. An antibody that binds to the purified polypeptide of claim 2. 

7. An isolated nucleic acid sequence encoding the polypeptide of claim 3. 



25 



30 



35 



8. An isolated nucleic acid sequence comprising a sequence having at least 95% sequence 
identity to one of SEQ ED NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 9, 
SEQ ID NO: 1 1, SEQ ED NO: 13, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ED NO: 
20, SEQ ID NO: 22, SEQ ID NO; 24, or SEQ ID NO: 26. 

9. The isolated nucleic acid sequence of claim 8, wherein the nucleic acid comprises a 
sequence as set forth as SEQ ED NO: 2, SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 
9, SEQ ID NO: 1 1, SEQ ED NO: 13, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ED NO: 
20, SEQ ED NO: 22, SEQ ID NO: 24, or SEQ ID NO: 26. 

10. An isolated nucleic acid sequence comprising a promoter operably linked to the nucleic 
acid sequence of claim 7. 



11. 



A vector comprising the nucleic acid sequence of claim 7. 
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12. A host cell transfonned with the vector of claim 11. 

13. The vector of claim 11, wherein the vector is a viral vector. 

5 

14. A viral particle comprising the viral vector of claim 12. 

15. An isolated nucleic acid molecule that hybridizes under highly stringent conditions with 
a nucleic acid molecule encoding the polypeptide of claim 5. 

16. A method for testing the efficacy of a drug in the treatment of a condition associated 
with infection with Japanese macaque herpesvirus (JMHV), the method comprising: 

(a) administering the drug to a non-human primate infected with a 
Japanese macaque herpesvirus (JMHV); and 

(b) observing the non-human primate to determine if the drug prevents or 
reduces the presentation of one or more symptoms associated with 
Japanese macaque herpesvirus (JMHV) infection. 

17. The method of claim 16, wherein the drug is a drug used to treat multiple sclerosis. 
20 

18. The method of claim 1 6, wherein the non-human primate is a Japanese macaque 
monkey. 

19. A method for testing the efficacy of a candidate vaccine against Japanese macaque 
herpesvirus (JMHV) infection, or conditions associated with Japanese macaque herpesvirus 
(JMHV) infection, the method comprising: 

(a) administering the vaccine to a non-human primate susceptible to 
infection with the Japanese macaque herpesvirus (JMHV); 

(b) moculating the subject with the Japanese macaque herpesvirus 

(JMHV), and 

(c) observing the non-human primate to determine if the vaccine prevents 
or reduces an incidence of Japanese macaque herpesvirus (JMHV) infection or a symptom 
associated with Japanese macaque herpesvirus (JMHV) infection. 

35 20. The method of claim 19, wherein the non-human primate is a Japanese macaque 

monkey. 

21. Hie method of claim 19, wherein the symptom is associated with multiple sclerosis. 



10 



15 



25 



30 
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22. A method of detecting the presence of Japanese macaque herpesvirus (JMHV) or a 
related virus in a biological specimen, comprising: 

( 1 ) amplifying by polymerase chain reaction a Japanese macaque herpesvirus 
(JMHV) nucleic acid sequence, if such sequence is present in the sample, using two 
or more oligonucleotide primers comprising 20 contiguous nucleotides of the 
nucleic acid sequence of claim 7; and 

(2) determining whether an amplified sequence is present 

23. The method of claim 22, wherein the step of determining whether an amplified 
sequence is present comprises one or more of. 

(a) electrophoresis and staining of the amplified sequence; or 

(b) hybridization to a labeled probe of the amplified sequence. 

24. The method of claim 23, wherein the amplified sequence is detected by hybridization to 
a labeled probe. 

25. The method of claim 24, wherein the probe comprises a detectable non-isotopic label 
chosen from the group consisting of: 

a fluorescent molecule; 

an enzyme; 
a co-factor; 

an enzyme substrate; and 
a hapten. 

26. The method of claim 23, wherein the biological specimen is a primate specimen. 

27. The method of claim 26, wherein the primate specimen is a non-human primate 
specimen. 

28. A method of detecting the presence of Japanese macaque herpesvirus (JMHV) in a 
biological specimen, comprising: 

exposing the biological specimen to a probe that hybridizes to a Japanese macaque 
herpesvirus (JMHV) nucleic acid sequence of claim 7, if the sequence is present in 
me sample to form a hybridization complex; and 
determining whether the hybridization complex is present 
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29. nie method of claim 28, wherein the biological specimen is a priimte specimen 



30. The method of claim 29, wherein the primate specimen is a non-rmman primate 
specimen. 

5 

31 . A method of obtaining a Japanese macaque herpesvirus (JMHV) -related nucleic acid 
sequence comprising: 

(a) amplifying a for rhesus rhadinovirus (RRV) nucleic acid sequence using two or 
more oligonucleotide primers comprising 20 contiguous nucleotides of the nucleic 

10 acid sequence of claim 7 to generate a product amplified nucleic acid sequence; and 

(b) detecting the presence of a product amplified nucleic acid sequence; and 

(c) isolating the product amplified nucleic acid sequence. 

32. The method of claim 31, wherein ampliiying the for rhesus rhadmovirus (RRV) nucleic 
1 5 acid sequence comprises polymerase chain reaction amplification. 



33. Hie method of claim 31, wherein the detection of an amplified sequence is present 
comprises detecting the amplified sequence using one or more of: 

(a) electrophoresis and staining of the amplified nucleic acid sequence; 
20 (b) hybridization of a labeled probe to the arnplified nucleic acid sequence; or 

(c) sequencing the amplified nucleic acid sequence. 



34. The method of claim 33, wherein the amplified sequence is detected by hybridization. 

25 35. A method of obtaining a Japanese macaque herpesvirus (JMHV) -related nucleic acid 

sequence from a sample comprising 

(a) contacting a nucleic acid of the sample with a probe or primer that hybridizes 
under moderately stringent hybridization conditions to the nucleic acid of the 
sample, wherein the probe or primer comprises 15 consecutive nucleotides of the 

30 nucleic acid sequence of claim 7; and 

(b) isolating the nucleic acid of the sample to which the probe hybridizes. 



35 



36. The method of claim 35, further cornprising sequencing the nucleic acid of the sample 
to which the probe hybridizes. 



WO 01/88203 



PCT/US01/16274 



-81- 

37. A method of detecting the presence of Japanese macaque herpesvirus (JMHV) in a 
biological specimen, comprising: 

contacting the biological specimen with the antibody of claim 6, 
detecting binding of the antibody to the biological specimen or a component 
S thereof wherein binding of the antibody to the biological specimen indicates the 

presence of aJapanese macaque herpesvirus (JMHV). 

38. The method of claim 37, wherein the antibody is detectably labeled. 

10 39. The method of claim 37, wherein the probe comprises a detectable non-isotopic label 

chosen from the group consisting of: 

a fluorescent molecule; 

a cherriihrminescent molecule; 

an enzyme; 

IS a co-factor, 

an enzyme substrate; and 
a hapten. 

40. A kit comprising a container means comprising an oligonucleotide primer comprising at 
20 least 1 5 contiguous nucleotides of the nucleic acid sequence of claim 8. 

41. A kit comprising a container means comprising an antibody of claim 6. 

42. A virus comprising a nucleic acid sequence that has 95% sequence identity with a 
25 nucleic acid sequence of the virus of claim 1 . 

43. The virus of claim 42, wherein infection with virus in a subject results in d^myelinatmg 
encephalomyelitis in the subject 

30 44. The virus of claim 42, wherein the virus is a human virus. 

45. A nucleic acid sequence obtained by the method of claim 31 or claim 35. 



35 



46. 
47. 



The nucleic acid sequence of claim 45, wherein the sequence is a viral sequence. 
The nucleic acid sequence of claim 46, wherein the viral sequence infects human cells. 
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48. A non-human primate model for multiple sclerosis, comprising a non-human primate 
infected with a Japanese macaque herpesvirus (JMHV), wherein the non-human primate 
exhibits a symptom or a pathological feature of multiple sclerosis. 

49. The non-human primate model of claim 48, wherein the symptom is acute onset paresis 
or paralysis involving one or more limbs, 

50. The non-human primate model of claim 48, wherein the pathological feature is myelin 
destruction in a central nervous system 

51. The non-human primate model of claim 48, wherein the non-human primate is a 
Japanese macaque. 
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FIG. 1 

Phylogenetic Comparison of the Gammaherpesviruses 
I -EBV 



AHV 



MHV 



HVS 



KSHV 



' "RRV 

EBV - Epstein-Barr virus* 

AHV - Alcelaphine herpesvirus 

MHV - Murine herpesvirus 68 

HVS - Herpesvirus saimiri 

KSHV - Kaposi's sarcoma-associated herpesvirus 

RRV - Rhesus rhadinovirus 17577 
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FIG. 2 

Restriction Fragments of the RRV 17577 Genome 

BamHI EcoRI Hind III 



fragment fragment fragment fragment fragment fragment 

number size (bp) number size (bp) number size (bp) 
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17189 
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12476 


1* 


22006 


2 


15598 
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10342 


2 


17108 


3 


15441 


3 


9565 


3 


16542 
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12360 


4 


9213 


4 


14134 


5* 


8943 
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8465 
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11516 
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7747 
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10743 


7 
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8452 


8 
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9 


7278 


9 
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10 
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10 


7002 


10 


3374 


11 
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11* 


5400 


11 
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12 
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12* 


5054 


12 


2891 


13 


3411 


13 


4907 


13 


2849 


14* 
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14 


4771 


14* 


2832 


15 


3008 


15 
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15 


. 1599 


16 


2916 


16 


4099 


16 


1272 


17 


2210 


17 


3516 


17 


1016 


18 


1343 


18 


2102 


18 


853 






19 


1868 


19 


811 






20 


1603 










21 


1512 










22 


1221 










23 


910 










24 


624 










25 


609 










26 


592 










27 


584 










28 


122 










29 


107 







* Indicates that the fragment size excludes terminal repeat sequences 
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FIG. 3 
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MacVector Output for long unique region of rhesus 
rhadinovirus 17577 



uocos 

DEFHHTION 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 
pept 
pept 
pept 



UQNS tKEIQa 131634 BP CS-OJA 



UPDATED 06/26/98 



513 
2418 
2836 



To/Span 
1784 
1852 
4773 



1 
(CI 
1 



Description 
Rl 

Similar to HHV8 Orf 
Similar to HHV8 Orf 
protein 



dihydrof olate reductase 
complement binding 



pept 


5205 


8603 


1 Similar to HHV8 


Orf 


6 - 


- ssDHA. binding protein 


pept 


8628 


10688 


1 Similar to HHVB Orf 


7 - 


- transport protein 
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. 1 Similar to HHV8 Orf 
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- glycoprotein B 
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17575 


1 Similar to HHVB Orf 


10 
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(CI R2 viral Hr-6 








pept 
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19938 
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70 
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frag 
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repeat sequence 
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repeat sequence 
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16 
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FIG. 5 



Comparison of Corresponding Repeats in 
RRV and KSHV 



virus 


insert 
name 


total 
length 


repeat 
unit 
length 


G.+ C 
content 


KSHV 


fmk 1 


332 bp 
292 bp 


20 bp 
30 bp 


80.1% 
84.9% 


RRV 


syko 1 


304 bp 
1008 bp 


26 bp 
25 bp 


53.3% 
79.9% 


KSHV 


zppa 1 . 


308 bp 
244 bp 


23 bp 
23 bp 


74.0% 
77.9% 


RRV 


vrtgo 1 


405 bp 
1029 bp 


.19 bp 
32 bp 


74.6% 
84.4% 



virus insert total repeat G + A 
name length unit content 
length 



KSHV mdsk 409 bp — 2 75.4% 

RRV brds 196 13 bp 81.6% 

1 KSHV fmk and zppa and RRV syko and vrtgo are tandem repeats. 

2 KSHV mdsk Is a complex repeat with no defined unit length. 
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FIG. 10 

atg ttc cct gtc tgg ttc gtc ttg ttt tac ctg teg tgt tgg gcg gec 48 
Met Phe Pro Val Trp Phe Val Leu Phe Tyr lieu Ser Cys Trp Ala Ala 
15 10 15 

age cct acg ctg gcg cct ccc ccg act gec get gga att aac -gtt etc 96 
Ser Pro Thr Leu Ala Pro Pro Pro Thr Ala Ala Gly lie Asn Val Leu 
20 25 30 

ccc cag tgg gec ggc aac cgc gec tct ctt gac agg ace agg ggg cgc 144 
Pro Gin Trp Ala Gly Asn Arg Ala Ser Leu Asp Arg Thr Arg Gly Arg 
35 40 45 

ctg tct gaa gtg ggg tta aac ata cag cgc tgg ttc gtt tac ctg tgc 192 
Leu Ser Glu Val Gly Leu Asn He Gin Arg Trp Pt* e v *l Tyr Leu Cys 
50 55 60 

cac cac tec act etc tgt egg gtg cgt gag tac ccg cgc ate atg teg 240 
His His Ser Thr Leu Cys Arg Val Arg Glu Tyr Pro Arg lie Met Ser 
65 70 - 75 . 80 

ttt gtt cac ttc cct ata ttg atg tct aac gtt gag tgc cag cgc cgc 288 
Phe Val His Phe Pro He Leu Met Ser Asn Val Glu Cys Gin Arg Arg 
85 90 95 

gag ttt cgc ggg gec gag tgt atg aac gee atg gtt cgc ggg etc egg 336 
Glu Phe Arg Gly Ala Glu Cys Met Asn Ala Met Val Arg Gly Leu Arg 
100 105 110 

gec tac gag agt tac ctg acg cga ctg agg atg ctg ctg gac gac gcg 384 
Ala Tyr Glu Ser Tyr Leu Thr Arg Leu Arg Met Leu Leu Asp Asp Ala 
115 120 125 

ccc ggg gac gcg gac gec gcg gee att ggc tec gcg gtg ace gtg gtg 432 
Pro Gly Asp Ala Asp Ala Ala Ala He Gly Ser Ala Val Thr Val Val 
130 135 140 

ctg tec gee etc gac tct eta att gag gag ctt ccc gta aat aac aag 480 
Leu Ser Ala Leu Asp Ser Leu He Glu Glu Leu Pro Val Asn Asn Lys 
145 150 155 160 

ata ggt ggc gcg gag tct aat gaa aaa ace gtg cgt gcg ttg gga ggg 528 
He Gly Gly Ala Glu Ser Asn Glu Lys Thr Val Arg Ala Leu Gly Gly 
165 170 175 

cag age ccc egg gac gtt gtt etc age gcg ttt cgc ata ctg gaa tat 576 
Gin Ser Pro Arg Asp Val Val Leu Ser Ala Phe Arg He Leu Glu Tyr 
180 185 190 

eta cag atg ttt ttg egg gac ggg cgc cgc gca ata get atg atg taa 624 
Leu Gin Met Phe Leu Arg Asp Gly Arg Arg Ala He Ala Met Met 
195 200 205 
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FIG. 11 

atg agg ggc ctt ttc gtg tgc gtt ttt ttt gcg gtg ttc gcg tgt gta 48 

Met Arg Gly Leu Phe Val Cys Val Phe Phe Ala Val Phe Ala Cys Val 

1 5 10 15 

gtt gat tat gcc ttt cct atg ggc tec atg age gga ccc gcg ccc gaa 96 
Val Asp Tyr Ala Phe Pro Met Gly Ser Met Ser Gly Pro Ala Pro Glu 
20 25 30 

etc tgc tgt ttg ggg tat gta act cat ctg ccg cca ccc ggt tta gtg 144 
Leu Cys Cys Leu Gly Tyr Val Thr His Leu Pro Pro Pro Gly Leu Val 
35 40 45 

gtc tct tac tec cac acc teg teg cag tgc teg gtg gac gcc gtg ata 192 
Val Ser Tyr Ser His Thr Ser Ser Gin Cys Ser Val Asp Ala Val lie 
50 55 60 

tta aac act cgc cgc ggt aaa aag ctg tgt gcc aat ccc ggg gac gac 240 
Leu Asn Thr Arg Arg Gly Lys Lys Leu Cys Ala Asn Pro Gly Asp Asp 
65 70 75 80 

gca gtg aag aaa ctg ctt cag gc*g gtg gac aag cgt ccc aaa aag ggc 288 
Ala Val Lys Lys Leu Leu Gin Ala Val Asp Lys Arg Pro Lys Lys Gly 
85 90 95 

aga aga acc egg cgc age ctg att gac gat tec gaa gag ggc ctt ggc 336 
Arg Arg Thr Arg Arg Ser Leu lie Asp Asp Ser Glu Glu Gly Leu Gly 
100 105 110 

a 9C ggg att tag 348 
Ser Gly He 
115 
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ttttacgtaa aacatattgt aatttaatca cgtgatgttt atgttggcta ctaacacaca 3540 

atgtttgcat aaaagcactt atgttagtga gcttcaatct ttatatttta ttcaaattgt 3600 

ttaaagaggc atgtgaatat atttaacacg cattcagaaa taggcgcaac gctgtgccgt 3660 

ataactggta aaaacatgac gtttaaactt tttcctctgt ttttattaca cgccataatg 3720 

tacgtccact gcgatgaaaa ctgtaaacct ccacatttca cggaatatcg cgtcaagtct 3780 

aacacagaaa aggacttata tagtgttgga gaaacagctg aattaatttg tcgtcctggt 3840 

tatgttacaa atacaaaaat aataacaaca gaatgtttac aaaatggtac gtggtcaaca 3900 

ccaaattttc catgcgacag aaaaagatgt cccacacctg ctgacttgct gaatggagcc 3960 

gtgcacattc acgggggaga taatgcctta aaatttggat ccaatatttc ctatgagtgt 4020 

aatgaaggtt atgatttaat tggtagtaat gttcgttttt gtattttaca agacacagaa 4080 

aacgtaaatt gggattcaaa tgaaccagtc tgtgaaattc agaaatgtat taaaccaccg 4140 

gcagtggaac atggggacta cctacctaac caagatgttt ataactatgg agatgcaatt 4200 

acatttaaat gttcattgtc gtatacactc gttggatcaa caacattagt atgcacgtca 4260 

aacaaaaagt ggtcaaactc tttcccaacg tgtttaatgc tcgtatgtga aagtccacaa 4320 

atagacaatg ggtacataga cattggcttg tccagaagat acaaccatgg acaatcaatt 4380 

actgtaaagt gtagcgacgg gtacaacatt gttgggcctg aaacattaac gtgcacaaac 4440 

acaacttggg ttccaccatt acctaagtgt gtattagtta caaataaccc aagcacaccc 4500 

atgccagaaa cacccatgcc agaaacaccc acgccagatt atcaaaaaat aaatttgtca 4560 

accgctaaaa ctgcaacaac accaaatgcg tttgttacaa ctgttgtttc tccagagaaa 4620 

gacgacgtta cttgtgtaaa gcctcatttt gagcgattca tggtaaaggc tgaaaatgac 4680 

aaggaaaaat acagtgttgg tgcgagcgtt gagctaatat gtcgaccagg atttactaaa 4740 

atgcagtcta cagtttctgt tgaatgtttg tccaacggaa catggactgc tccaaatgcc 4800 

aagtgtcata gaaaaaaatg tccaacccct caagaacttt taaacggaga gtatatagtt 4860 

acaagcggag aagatgcttt taagtacgga acaaatataa catataaatg taatgaaggt 4920 

tatcaacttt taggaagtat ggtgcggatt tgtatgctta aagacgattt aaaaacagtt 4980 

gactgggagc caaaagcgcc tatatgtgat attgaaaaat gtaagccacc gccacaaatt 5040 

acaaacggaa aataccatcc ggtgaaagac ttttatcagt atttggacac cgtaacattt 5100 

tcgtgcaatc gtgacttttc tttagttgga gatgaaatga caacgtgtat aagtaatacg 5160 

tggaataaac cgtttccaag atgtgaacaa atcacttgca gcgctcctaa tattgcacac 5220 

ggaaagctgc taacaggttc ttcaagcgtt tacaaatacg gtcaatctgt taccattggt 5280 

tgtgaaactg gatttactct aattggcagt gaaatttcta catgcaagga ttcatcgtgg 5340 

gatccaccac ttcctacgtg cgtgccagct gtttcaatgc cttctgacac acctaaacca 5400 

gaaaccaaaa aaccaaacac gccaacgcca gaagcaccca aaccaaacac cccaaacgtt 5460 

ggaacacata caccattcaa accaccacca caaaatccac caatagcacc cccaatgagt 5520 

aaatggaaaa ggcatgtcgt gttagttctt tttgcaagtg tcgcgtcctt gttattcgta 5580 
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7080 


tgctgtacct 


aactaaagtg cagaagcaag 


cgcccagggg 


acaaaaggga 


gaegtaaacg 


7140 


tgtacaactc 


ctttttcctc caacacggac 


tggggttttt 


aaatgaggee 


acgatcaagg 


7200 


aaaacggcag 


cgaagccttt aagggcgtac 


cctcaaacgc 


cctcgatggt 


tettegttea 


7260 


cgccgtatca 


cctggcctac gccgcgtctt 


tctcgcccca 


tctgctggcg 


aagttatgtt 


7320 


attacatgca 


gttcttgcaa caccacaaaa 


gctccacgaa 


ccaggcgttt 


aacatggtcc 


7380 


attatgtcgg 


caccgccgcc aactcagaga 


tgtgcacgct 


atgtcacggc 


aacacgccgg 


7440 


caacgtgcct 


caacacgctg ttctatagac 


tgaaggatag 


gtttcccgcc 


gtaaccaccc 


7500 


ctcagcgcag 


ggacccctac gtggtgaccg 


gaacagcegg 


gacctttaac 


gacctggaga 


7560 


ttctgggcaa 


cttcgcgagc tttagagacc 


gcgaagagga 


cggaaacccg 


gccgacgagc 


7620 


acccaaagta 


cacgtactgg cagctatgtc 


agacegtgae 


agaaaagcta 


tecgegattg 


7680 
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gaatcaccga agaccacgat aatcacgtga acctcatcac caacatccaa agttttctca 7740 

gggtgttcaa gggtatcgac tcaattgtgg acggagaggt catgaagttc gttaattcga 7800 

tgattaaaaa taactttaat ttccgcgagc acgtcaaatc ggtccatcac atactccagt 7860 

tctgctgcaa cgtgtattgg caggcgccgt gcgcggtgtt cctgaatctg tactacaaat 7920 

ccctgctgtg gatcattcag gatatctgcc tgccgtactg • catgatctac gaacaagata 7980 

atccggcgat gggcatcctc ccctccgagt ggctaaagat gcattttcag acgttgtgga 8040 

cgaactttaa agcggcgtgt ctcgaccgcg gggtcctcac ggggtgcgaa ctgaaaatcg 8100 

tacaccggga catgttctgc gacttcttcg acaccgacgc ggggtccaac ggcttaatgg 8160 

ccccctttaa aatgcaggtt agaatagccc gagccatgat ggtcgttccg aaatcaatta 8220 

aaataaaaaa tagaatcatt ttttccaaca ccgcgggatc cgaggcggtg cagtcggggt 8280 

tcgtcaaacc gacgggaacc agggacactt acgtggtggc cggaccgtac atgaagtttc 8340 

tcaactcgct gcatcgcgcg ctgttccccg acaccaagac cgccgcgctg tacctgtggc 8400 

acaagatctc ccagaccaac aaaaccccag ttctgaaaga cgtcccggac gacgagctgg 8460 

cggagctggt gtcgtacgta aagaccaaca gcctcgcgtt cgaggaaacg aacgtgctgg 8520 

acgtggttcc ggattcactc atgtcgtacg cgaggatcaa actgaacggg gccattctaa 8580 

gggcatgtgg ccagattcag ttctacgcca cgacgctgca ctgcctcacg ccggtgctac 8640 

agacgatcga tgccgaggaa tacccccacg tgctgggctc cgcggcaatc gccacaccgg 8700 

tggcttacct ggcagaaata cgcggccgca ccgccctcac cgtccagacg acggcgcgtc 8760 

agccggtcgc cgccacaggg cgcctgcgtc ccgtgataac cgttcccatg gtagtcaaca 8820 

aatacacggg ggtcaacggg aacaacaacg ttttccactg cggaaacctg gggtacttcg 8880 

cggggcgcgg cgtggaccgc aacctgtggc cggaaagctc cccctttaag aaaacgggcg 8940 

tcagcgccat gctaagaaag agacacgtca tgatgacccc cattatcgac cgcctaataa 9000 

agcgagccgc gggacagaca atcagcacgt tcgaggcgga aagcgttaaa aggagcgtgc 9060 

aggcgctgtt agaggataag gacaacccta acctattgaa gtcggtaatc ttggagctta 9120 

tacgacacct ggggaagggc tgccaggact taagctccga ggacgtgcaa tattacctcg 9180 

gtgactattg tatgttgacg gacgaggttt tatttacgtt ggataatata gcacagtcag 9240 

gcgtgccgtg gactatcgag gacgcgggtg ccctaataga ggatcgccag gacgcagacg 9300 

atcttcagtt cgtagacagc gacgatatcg ccaccgcttc ctgtcagccc cccgaggaac 9360 

agctaccgac ccctagcgcc ggcgccctac tggccgggaa gaagcgaaaa attaacgcgc 9420 

tgctgagcga tctagacctt taggaaaacc gtggcaggcg ggcaacaatg gccagggaac 9480 

tcgcagcatt atacgcgcag ctgtcggccc tcgccgtcga cttgagtctg gttatctttg 9540 

cggacccgcg aagtatcgac ggtgcccgca ttctaaaaac aaaaacacag atagagaacc 9600 

tgaaccgcga ccttctgccg ctgctacgcg agcaaaactc ggtagagacg tccagcctgt 9660 

cgctcgaagt ggagcacctg gccaaaaaca tcgaggacaa actcggcgag ctggagcgca 9720 

gtctgcggca gagatattcg agccgagagc attttgaaac actacacctg agacccgaat 9780 
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gtcactatca ctctacggtt 


acttttcagt 


tttacggggg 


cgggttaata 


gatgtaaaca 


9840 


tgtgcctaat aaacgatgta 


gaactgctgt 


gtaaaagact 


agggagtgtg 


ttttattgca 


9900 


tcggtgcaaa cgaagctctg 


tccggattga 


accgggttct 


gacgtttctg 


tcaacactgc 


9960 


ggggtatctc cccgatcccg 


cacccagacc 


tatacgtcac 


gtcagtgcct 


tgcgtacagt 


10020 


gcctgaggga aatcgaactc 


gtaccaaatc 


aggggtccag 


tttactcgcg 


gtgttggcag 


10080 


accgacactg cgatcacctc 


tgtaagaagg 


ttagggcgga 


gccaatacac 


ggcctgtttg 


10140 


agacagaact gagccagctg 


ggtctaaaag 


taacaaaacg 


ttcggacgcc 


acgcagcacg 


10200 


gcgtccggtc ctctgcagat 


cagttaaggg 


agtcgtcgct 


ggcggccata 


caagatcaca 


10260 


atatattcaa acgggtgtcc 


gcgtcaat;ca 


tggaactatc 


caatctaatt 


tattggaacg 


10320 


ccgggcaaac cggcctccag 


accgggaccg 


aaaacgagtg 


ctcacaaatg 


gccagactgc 


10380 


taacacacga ggccgatatg 


cacgagcacc 


gtgcgctaat 


aacacccaaa 


ctaagcgcga 


10440 


ctcacttcta cgactgtttc 


cgaccggatc 


ccatagaatc 


cctgttctgc 


ggcggtcttt 


10500 


ttaactctat agacgacacc 


ataaacgcac 


tgagccggga 


ttgctccgtg 


acgttctttc 


10560 


aacaggcaaa ctataccaac 


gttatgcgaa 


aacaaaacga 


gctgttcacc 


agactcaata 


10620 


gcatcctgcg tcaggggagc 


gcgggatcgc 


aaaaaccggc 


caccccctcg 


gagccacgga 


10680 


ccaccaccgt ggcggcaacc 


gcggcaagcg 


acgtcatcaa 


agacgcacag 


tatcgcaaag 


10740 


aacagtacat gaaaaaggtg 


gccagggacg 


gctttaaaaa 


actaacagag 


tgtctgcaga 


10800 


cgcagagcgc ggtgttggca 


aacgcactct 


gcatgcgcgt 


atgggggggc 


gtcgcctacg 


10860 


gcgaggcgtc cgagctggtg 


aaccattttc 


tcctcaggcg 


gcgcttcgtc 


gcgcttccct 


10920 


gggaggcgcg ctgccgctcg 


gatcagattt 


tattcgaaaa 


ctcaaagtac 


attaaaaact 


10980 


cactatattc ccagcgcctc 


agtcgcgaac 


acgtagagat 


tatcacgctg 


cagttttacg 


11040 


gcctgataac cggccccctg 


acgcgccaga 


gcgatctctt 


tcccggcccc 


gccaacgtcg 


11100 


cgctggccca gtgtttcgag 


gcggccggaa 


tgcttccgca 


tcacaagatg 


ctggtgtcag 


11160 


agatgatatg gccccagatt 


caaccgaaag 


actggataga 


ccagacattt 


aatcgttttt 


11220 


accaacttcc cgaaggtgat 


ctcaacgcgg 


tacaaaagtc 


cgcctggtgc 


tttatacgag 


11280 


agctcgtcct ctcggtggcg 


ctttataatc 


gcacgtggga 


aaagacgctg 


cggatatttt 


11340 


ccctagcgcg cgagaaactc 


tccatctcca 


acctagacgt 


taaaggcctg 


acgtccggcc 


11400 


tgtatctaac gtacgagcaa 


gacgcgccgc 


tcgttctaat 


ttctcaaaat 


accggctgga 


11460 


tatttaaaga cctgtacgct 


cttctgtacc 


atcacctgca 


actgtccgac 


ggccatgatg 


11520 


ataactaacc gaacgcgtcg 


tctcctgcgg 


gcgtgggtcg 


tgataatcgc 


gatcggcacg 


11580 


gcggttggcg aaaacgtcac 


cacccccaag 


ggcgcgacca 


ccaccgcgaa 


gccaacgccg 


11640 


ggcccgtcga cgcccacacc 


tcccgagaac 


ccacctaggg 


ccgaggcgtt 


taagtttcgc 


11700 


gtgtgcagcg cctcggccac 


cggcgaactc 


ttcaggttta 


acctggaaaa 


aacgtgtccg 


11760 


ggcaccgagg acaagacgca 


ccaagaaggc 


atcctgatgg 


tgtttaaaaa 


aaatattgtc 


11820 


ccgcacatct ttaaggtcag 


acggtaccgc 


aaggtggcca 


cctcggtgac 


cgtctatcga . 


11880 
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gggtggaccg agaccgccgt gaccggcaag 


caagaggtca 


tccgaccggt 


gccgcagtac 


11940 


gagatcaacc acatggacac gacctaccag 


tgtttcagct 


ccatgcgcgt 


aaacgtcaac 


12000 


ggcatagtaa acacctacac ggacagggac 


ttcactaacc 


agaccgtgtt 


tctgcaaccg 


12060 


gtcgaggggc tcacggataa catccagcga 


tacttcagtc 


agccggtgct 


gtacacgaca 


12120 


ccgggatggt ttccgggaat ttacagggtc 


agaaccacgg 


tcaactgcga 


gatcgtggac 


12180 


atgatcgcgc gttcggcgga accgtactcg 


tattttgtca 


ccgccctggg 


agacacggta 


12240 


gaggtgtccc cgttctgcca caacgactca 


acgtgctcgg 


tcgcggagaa 


aaccgaaaac 


12300 


ggcctcggcg cccgcgtgct cacaaattac 


accatcgtcg 


acttcgcgac 


ccgccagccc 


12360 


accaccgaaa cgcgggtctt cgccgactcg 


ggagaataca 


ccgtatcgtg 


gaaggcggag 


12420 


gaccccaagt cggcggtctg cgcgctgacg 


ctctggaaaa 


ccttccccag 


ggcgatacag 


12480 


acgacgcacg aggccagcta ccacttcgtg 


gccaacgacg 


tgacggcgac 


cttcacgtcc 


12540 


ccgctctccc aggtaactaa cttcacgggc 


acgtacccct 


gcctcaatga 


tgttattcag 


12600 


aaaaccctca acgccaccat caagaagctg 


tccgataccc 


acgcaacaaa 


cggatcggag 


12660 


cagtactacg aaaccgaggg gggtctgttt 


ctcctgtggc 


agccgttaac 


gccgctaagc 


12720 


ctagctgacg agatgcgcga attaaacggc 


accacgccag 


caccccccac 


cacaacctca 


12780 


accgccaacc gcgttcgaag aagcgtcggt 


acgaacgagc 


aggcaacgga 


cgacctagcg 


12840 


gcgccccagc tgcagttcgc ctacgacaag 


ctccgcgcga 


gcatcaacaa 


ggtgctggag 


12900 


gagctctcca gggcgtggtg ccgagaacag 


gtgagggaca 


cctacatgtg 


gtacgaactg 


12960 


agcaagatta accccaccag cgtaatgacg 


gcgatatacg 


ggcggccggt 


gtcggccaag 


13020 


ttcgtgggcg acgccatctc cgtgacggac 


tgcgtggcgg 


tggaccaggc 


gtccgtcagc 


13080 


atccacaaga gcctccgcac gtccaccccg 


gggatctgct 


actcgcgccc 


cccggtcacg 


13140 


ttcaggttcc tcaacagcac cacgctgttc 


aagggccagc 


tgggacccag 


aaacgagatc 


13200 


atactgacgg acaaccaggt ggaggcgtgc 


aaagagacgt 


gcgaacacta 


cttcatagcg 


13260 


agcaacgtaa cctactacta caaagactac 


gtcttcgtga 


aaaaaattaa 


cacctccgag 


13320 


atatccaccc tcggtacgtt catcgccctg 


aacctgtcgt 


ttatagagaa 


catagatttc 


13380 


agggtcatcg agctgtacag ccgcgcggag 


aaaaagctgt 


ccgggagcgt 


tttcgatata 


13440 


gaaaccatgt tcagggaata caactactac 


acgcaacgcc 


tggcgggact 


ccgggaggac 


13500 


ctggacaaca cgatcgacct gaaccgcgac 


cgcctggccc 


gcgacctgtc 


cgagatagtc 


13560 


gcggacctgg gcgatgtcgg ccgcacggtc 


gttaacgtgg 


ccagtagcgt 


gataaccctg 


13620 


ttcggatcaa tcgtgagcgg gttcattaac 


tttataaaga 


gtccgttcgg 


gggcatgctc 


13680 


atgatcctgg tgattgtggc ggtcgtcctg 


atcgtgtttg 


cgctaaaccg 


gcgcaccaac 


13740 


gccatcgccc aggcccccat caggatgatc 


taccccgaca 


tagacaaaat 


gcagccctct 


13800 


ggcggtaaag tcgaccagga gcagattaaa 


aacattctcg 


ccggcatgca 


ccagctacag 


13860 


caggaagagc gtaggcggtt agacgaacag 


cagaggtcag 


cgccctcgct 


tttccggcgc 


13920 


gcgtcagacg gactaaaacg tcgctttagg 


ggatataaac 


cgctggaaaa 


cgaagaggct 


13980 
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caagagtatg 


aaatgagcaa 


ataaccacac 


ccacacgcct 


gtacttgccg 


cccgccagag 


14040 


ccgcgcggcc 


aatcgattcg 


cgcacccggc 


cggtccccga 


caccctctag 


cgcccccggg 


14100 


cgtccgcggt 


ctgtatcaat 


catggatttc 


tttaacccgt 


acctgggccc 


tcgcggacca 


14160 


cgcccccact 


cacacagagg 


caccgatgct 


cccgcccctg 


ccggcgccgg 


agccgttcag 


14220 


ccgccaccag 


acgtttgcag 


gctcatcccc 


gcctgcctcc 


gaacgccagg 


ggcaggcggg 


14280 


atgatcccgg 


tcacgatccc 


gttcccgcca 


acgtacttcg 


agaacggtgc 


tcgcggagac 


14340 


gtgctgctcg 


ccaacgaacg 


gtccatgtgg 


acggcgcgcg 


accgcaagcc 


cgtcgccccg 


14400 


gacccccaag 


accaatccat 


cacgtttcac 


gcgtacgacg 


tcgttgaaac 


aacgtacgcg 


14460 


gcggacaggt 


gtgccgaggt 


acctagccgc 


ttccaaacgg 


acattatccc 


aagcggaacc 


14520 


gtgctcaagc 


tcctggggcg 


aaccgaggac 


ggcaccagcg 


tgtgcgtgaa 


cgtgttccgt 


14580 


caacaggtat 


atttctacgc 


gaaggttcca 


gccggcatta 


acgtcaccca 


catcctccag 


14640 


caggccctca 


agaacacagc 


cggccgtgcc 


gcgtgcggct 


tctcgaccag 


aagagtgaac 


14700 


aaaagaattc 


tcaaaacgta 


cgacgtcgcg 


gagcatcccg 


tcacggaaat 


cacgctatcg 


14760 


tccggttcca 


tgctctcgac 


cctcagcgac 


cgcctcgtcg 


cgtgcgggtg 


cgaggtgttc 


14820 


gagtcaaacg 


tggacgccgt 


tcgccggttc 


gttctggatc 


acgggtttac 


cacgttcggg 


14880 


tggtactcgt 


gcgcgcgcgc 


cacgccccgc 


ctggcggcca 


gagatgccag 


gacggccctg 


14940 


gagtttgact 


gcagctggga 


ggacctcagc 


gttcaagcgg 


accgcagcga 


ctggcccccg 


15000 


taccgcatcg 


tggcctttga 


tatcgagtgc 


actggagagg 


cgggatttcc 


gtgcgccacg 


15060 


cgcgacggcg 


acgcggtgat 


ccagatctcc 


tgcgtcttct 


acacgaccag 


ggaaggcgcg 


15120 


cccaatccgc 


caaacatact 


gttcagcgtc 


gggacgtgcg 


accccatccc 


ggacaccgac 


15180 


gttttggagt 


ttccgtcgga 


atatgacatg 


ctggtgtcgt 


tcttcgccat 


gatccgcgac 


15240 


ttcgaggtgg 


actttttaac 


cggctataac 


atctcaaact 


tcgatctccc 


gtacctaatc 


15300 


acgcgagcgt 


cccaggtgta 


caaccttcga 


ttaaacgaat 


acacaaaaat 


aaaaaccggc 


15360 


tccatctttg 


aagttcacga 


gccccgtggc 


gggggagggg 


ggttcatgag 


gtcggtctca 


15420 


aaaattaaaa 


tagcgggcat 


cgtccccata 


gacatgtacc 


aggtgtgtcg 


cgaaaagctc 


15480 


agcctctccg 


actacaaact 


ggacacggtg 


gccaggcagt 


gtctgggtgg 


gaaaaaagag 


15540 


gacgtatcgt 


acaaggacat 


tccccctctg 


tttcgctcag 


gtccgggcgg 


cagggctaag 


15600 


gtgggcagct 


attgcgtgat 


ggactcggtc 


ctggtgatgg 


acctcttaaa 


aatgtttatg 


15660 


atacacgtgg 


agatttcgga 


gatagccaag 


ctggccaaga 


ttcaggccag 


gcgcgtcctg 


15720 


acggacggcc 


aacagctccg 


cgtgttctcc 


tgcctgctgg 


aggccgcggc 


cagggagaac 


15780 


tttatcctcc 


cggttccaac 


gcccgaggga 


caggggggct 


atcagggcgc 


gacggtgatc 


15840 


aaccccattc 


cggggtttta 


cgacgagccg 


gtcctggtgg 


tcgattttgc 


cagcctgtac 


15900 


ccgagcatca 


tccaggcgca 


caacctgtgc 


tactccacca 


tgatacacgg 


acgagacctg 


15960 


cacctgcacc 


ccaacctgac 


gccggacgac 


tacgagacgt 


tcgtgctgag 


cggcggaccg 


16020 


gtacattttg 


taaaaaaaca 


caagcgggag 


tctctgctgg 


gaagactgct 


aaccgtgtgg 


16080 
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ttagaaaagc gaagggcgat 


ccggcgcacc 


ctggcggcgt 


gcgatgaccc 


gtcgctaaaa 


16140 


accatcttag ataaacaaca 


gctggccatc 


aaggtgacat 


gtaacgcggt 


ttacgggttc 


16200 


accggggtgg ccagcggcct 


cctcccatgc 


attaacatag 


cggaaaccgt 


gacgctccgg 


16260 


gggcgcacga tgctggagat 


gtcaaagtct 


tacgtggagg 


ccctgacgac 


ggaagacctg 


16320 


cgaacgcgtc tcggtcgcga 


ggtgaccgcc 


cgtcacggcg 


cgcggtttcg 


cgtcgtctac 


16380 


ggtgacaccg actccctctt 


tatcgcgtgc 


gacggttatt 


ccgcggaagc 


cgtttccgct 


16440 


ttctgtgacg atctggccgc 


caggatcact 


gcggacctgt 


tccccccacc 


cattaagcta 


16500 


gaggcggaaa agacgttcaa 


gtgtctgctg 


ctgctgacga 


aaaagcgcta 


catcggggtc 


16560 


ctattgaacg acaaaatggt 


catgaaaggg 


gtcgacctca 


ttcgcaaaac 


ggcctgcaag 


16620 


tttgtccagg agcgatgccg 


cgccatcctg 


gacctggtgc 


tccacgatcc 


ggaggtcaag 


16680 


gctgcggcgc ggctgttgtg 


caagcggccg 


ccgcacgcgg 


tatacgagga 


ggggctgccg 


16740 


gctggcttta taaaaatcgt 


agaggtcctc 


aacgcgagct 


atctggacct 


ccgaaacagc 


16800 


gtcgtgccca tcgagcagtt 


aacgttctcc 


accgagctca 


gccgccccgt 


ctgcgattac 


16860 


aagaccacca acctgcccca 


cctggcggtg 


taccaaaagc 


tggcgagcag 


gtgcgaggag 


16920 


ctgccccagg tgcacgatag 


aatcccctac 


gtgttcgttg 


acgcgcccgg 


gtccctaaag 


16980 


tcggacctgg ccgaacaccc 


ggattacgtc 


agacagcacc 


agattcccgt 


cgcggtcgac 


17040 


ctatatttcg acaaactggt 


gcacggcgcg 


gccaacatcc 


tccagtgtct 


gttcggcaac 


17100 


aacgcggaca ccacggtggc 


catcctctac 


aattttctca 


acgtcccgta 


taagctgttc 


17160 


tcgtgaacgc caattggaga 


acgccaacat 


aagacgccgc 


gccagcggag 


tccgcaggga 


17220 


gagctcgagc ggcgaggagc 


gaccaacgga 


gaccgccacc 


atgctggtta 


acgaactgtc 


17280 


ggtggtcctc ggcgactggg 


aggtgacttt 


tcaccggggt 


agattcagct 


tcgtcaacct 


17340 


cacccgcctg caaacgttca 


agggccacgg 


gggctacgcc 


agggtccgac 


tccccttctc 


17400 


gctcgaccag ttactccacc 


aacatttcgc 


gttcggactc 


gtgacgcgtc 


tcaaggaact 


17460 


gccccccttc tccgactgcg 


tggcccttat 


cgccccgttg 


gattccggcg 


gcgacgcgga 


17520 


cgcggcgcgc gtggcccccg 


ggttcgtgct 


ggactcctct 


cgcccgctga 


ccgtgtgggt 


17580 


aaacgcgagc gggcggcaca 


cgatccggtt 


ctgcctcctc 


tttctaaagc 


cgatcgacct 


17640 


ggagcgcgcg gtcacgtacg 


tcttcggcga 


gaacggcggc 


gcgcgctcgg 


agggcacccc 


17700 


aaagcccacc tgcgcgaccg 


aaagcctgcc 


cggtggaccc 


ctgcgcgtct 


ccggcgaggc 


17760 


gtctcagacg tcgccccatt 


ctttcgttgc 


gtattttccc 


acggccaact 


cggtggcctg 


17820 


cctaagcctg ttgcggttac 


aggtgaggcc 


gttttcggat 


gacgcggcgc 


acagggacgc 


17880 


gcggatctcc ccgaaatacg 


tcacgtttag 


taactccggg 


ggtaacgtct 


gcaaggcgtc 


17940 


cgttcacacg ctgtccccgt 


cgcggtgtaa 


aacggcgcaa 


atggaaatca 


tctacgctcc 


18000 


cggggacccc aacgccgaga 


tagtcctggg 


ccagtccgga 


cccgtcctgc 


ccacccacac 


18060 


cggcggccgc gtattggggg 


tctacgccga 


cgccgaaaaa 


accatccaac 


ctggaagctc 


18120 


cgcggaagtc cgggttcagt 


taatcttcca 


acagggagcg 


gccgctcggg 


gcgatctggc 


18180 
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gtttctggtc acgggcgtgg caccggagcc cctattcgtc gtcaccccgg cactcttgct 18240 

ttccggttgc acaacccacc tgcgcctatt caaccccaac ggtaccccca cgactataaa 18300 

aagagacacc cttgtggccg ccgccgcgcc ctgccccgtg gtgcgattaa gctccgccga 18360 

cgacgcgccg cgagacctcg tcgcgtcacc agacaccggg gcgctctcca ttaacgcgtt 18420 

cacaatcccg gtcggtttcc caggggtggt ctcggcggag tgtcacgtgt ccctacgcga 18480 

caacggggtc cacgaacgca tgaaccattg acggcaacga tgggaacacc ggtgcgtttc 18540 

tttcgcggcg agtggcagac ctcgagtcta gtggacaacg gcacgccacg gtacagctcc 18600 

ctggtgtggg ccgccactat tcacgacggc tacctgacac tggtgaacag gtcagagctg 18660 

tgcgtcacgg agaggtctcc gtgtctgccg gcatgcccca gcatcgggag actggtcggg 18720 

aagaggtttc ccggcttcgc ctttgccagc gccactctgg gcgatcgggg aacacgcacc 18780 

gtgttctacg cgttcggtca ccgcgacaac ccactggaca tagtacccgc cgtggtcgag 18840 

cgcgcggatc gcgagctggt gctgcgggtt cacgctccgc aaacaacgcg ggtgtcgcga 18900 

tacggactta aggtattcgt ggcgatcgtt acggtggtgc gcccgcccgg ggtgttccta 18960 

cactttccac aagaccgcgt tccgatcgcg ctgacagacg cgtgcagcca ggagggctcc 19020 

aggctaacct ctgaagagcc gtggataaaa attcaaggct ttcccgtcct atctgacgag 19080 

accgcgcacc catttctctt aacccagaag accaagccct ttaccgagcg aaagttttgc 19140 

cgcctgatca tggacaacga ccagcgcagc gccgtcaaca ccgtctacct gggaaagcag 19200 

cacgtgaggg tgaccgtgac ccgccccccg gaaacaatcg tcaccgacgg ccccgtgacg 19260 

gcgaccctgt ccctcaccgg taatgcgcca atcgcctttc gccacaaccc atactttgaa 19320 

ctcccgtggt cgtccacaac ggcgatattc acgcccgtgg tgtacgtggg cctgaccgtg 19380 

tgcatcccac ccaactgtag caaattcgta aggtacggta acacctacgt ctcggcattt 19440 

aaccgcaagc tgacggcgat tattagcaat cacgcccaca acggcgggtt ccggattcag 19500 

gactgcgagt ggccaccgaa ccgggagata gagattttgg taaccaacgt gtcccaggcc 19560 

ccggtgtaca tcagcaccgg gacgcagctg gggcaagcca tcttcgtgtt cgcgccgcgg 19620 

ttcggtggcc cggcgaaact gcggcagctc ctcggccacc gatcgcgcgc cctggagctg 19680 

ccgggcgggg tgacagtgga cagccaaaaa ctgtgtaggt ttgagaccat gtacctgttt 19740 

tccacgtaaa ttactaataa accgtttgct cgtatcgctc acacaacgcc aaaccgtctc 19800 

tcattctcgg ggtcgcgcgc ctcgcgaaca cacaaggtgg ctcaaacacc cccctcccgc 19860 

accctcgcca cacaaaacca gttaacgcct tccgttagat gcagtttatt tattatttta 19920 

ttacatcata gctattgcgc ggcgcccgtc ccgcaaaaac atctgtagat attccagtat 19980 

gcgaaacgcg ctgagaacaa cgtcccgggg gctctgccct cccaacgcac gcacggtttt 20040 

ttcattagac tccgcgccac ctatcttgtt atttacggga agctcctcaa ttagagagtc 20100 

gagggcggac agcaccacgg tcaccgcgga gccaatggcc gcggcgtccg cgtccccggg 20160 

cgcgtcgtcc agcagcatcc tcagtcgcgt caggtaactc tcgtaggccc ggagcccgcg 20220 

aaccatggcg ttcatacact cggccccgcg aaactcgcgg cgctggcact caacgttaga 20280 
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catcaatata gggaagtgaa caaacgacat gatgcgcggg tactcacgca cccgacagag 20340 

agtggagtgg tggcacaggt aaacgaacca gcgctgtatg tttaacccca cttcagacag 20400 

gcgccccctg gtcctgtcaa gagaggcgcg gttgccggcc cactggggga gaacgttaat 20460 

tccagcggca gtcgggggag gcgccagcgt agggctggcc gcccaacacg acaggtaaaa 20520 

caagacgaac cagacaggga acatgaccgg ttaaattact cgcttacaat cgcgggcggc 20580 

ggccggtcaa cgccaggtcc attaaaaaca caccggccgc caacccccaa cgcggggccg 20640 

cgccctggaa cgcggttcct tccaatcgca aagaaccgcg tcacaaaaag ggctcgtttt 20700 

gaacccattt tgtgccatcg ggtttcgttt tcagatacgg aaacggctcg tccaaaaaca 20760 

cccaacgggg gtgtgctcaa acggccatct ccatctctat gtggggatgg gggtcgtagc 20820 

cctcgagact cagatccgcg cgcgtaaagt cctccagacg cgccaccttt ctcaaaatct 20880 

tcagccgcgg aaacggacgc ggggtcctcc gcagctgaag cagcagggga tcaacgtggt 20940 

tgttgtaaac gtgggcgtca cccaaggtgt gcacaaagtc tcccggggtc aggcccgtga 21000 

cgtgagcgat cagatacgtc aggagggcgt agctggcgat gttaaacggg accccgaggc 21060 

ccatgtcggc ggacctctgg tacagctggc aggacagctc cccccgagcc acgtaaaact 21120 

gacacaaaac gtgacaagga gggagagcca tccgcgcgag gtccgcgggg ttccacgcgc 21180 

acataacgat gcgccgatcg tggggccgcc tgttaattag atccaccacg taacgcagct 21240 

ggtccacccc ctgaccctcg tagttggcgt cggcccccct gtactccgcc ccaaaatgtc 21300 

tccactggaa cccgtacacc ggccccagat cgccctcgcg gcggtccccg aagccctgcg 21360 

ccgccaaaaa ggcgcgggac ccgtgcgcgt cccaaatttt tacgccgcgg cgcgacagtt 21420 

cggtggagtc ggtggagccc ctgataaacc acagcaactc ctccacgacg cccctccaaa 21480 

acaccctttt ggtggttaac agaggaaact cgtccctgag gttatatcgg gcctgaagcc 21540 

cgaacacgga cctggtgccc acgcccgtcc tgtcctccct ctgcacgccg tgtttaataa 21600 

ttaaatccag gtgcgccaag tactgcagct cgccgtggtc gccgcggcac gcgcagcgga 21660 

cggccgcggc accttcgggt gcggaaaaaa cccggagtcg tgcggccgcg attccagcca 21720 

cgcaaacggg tataattttt ttaacgtagc agataccgag atgcaccagg acaatcataa 21780 

ttacaaccga ggtgctttac aactataacc gtaaacggct gaagacgaaa cttattttaa 21840 

aggcagcttg ggcggggccg accacaggcg cggcccggct gttagcgagt aataaaccca 21900 

cgtggtctcc ccaagcgtct ccccgagcgt gagcgtctcc ccaaacgcca aaaacaacac 21960 

agataataaa ataaataaca tgtttatttt ttataaactt aacgcgcgat tatgcttctt 22020 

aacaggggcc aatagaagca acgtgcgagg cggcattcgc gcgctcaaag gcaaccgcaa 22080 

caccggcgct cgtaacagcg taaacaaaca actattagcc attcgtaacc gtaaaatcca 22140 

accctctgcg tcccgagcca ttcgcccagc tgcgtctacg tcgagggtgt ttctaagtcg 22200 

cattttgagc ccttggcacg gccgcgtctg cgttaatggt gtttctaaat cccgctgcca 22260 

aggccctctt cggaatcgtc aatcaggctg cgccgggttc ttctgccctt tttgggacgc 22320 

ttgtccaccg cctgaagcag tttcttcact gcgtcgtccc cgggattggc acacagcttt 22380 
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ttaccgcggc gagtgtttaa tatcacggcg tccaccgagc actgcgacga ggtgtgggag 22440 

taagagacca ctaaaccggg tggcggcaga tgagttacat accccaaaca gcagagttcg 22500 

ggcgcgggtc cgctcatgga gcccatagga aaggcataat caactacaca cgcgaacacc 22560 

gcaaaaaaaa cgcacacgaa aaggcccctc atggtgacaa aaacacaaaa caaagacggg 22620 

gtcgctcgtg cgttgccacc agcgccacgc tcgccgtgct cggccacggt tgacggcgtc 22680 

cattaatacc cgcggcccgg gcccgcctcg acggaaacca ggcgtggtta gccacctgac 22740 

gcacgtgcct cccgtccaga tgtggcattg cgagtcccaa aacgggcgtg tctaactacg 22800 

gtcgccccgg ggttcaaaga acggggatcc cccaaacagg taaaaagctt tttgtccaaa 22860 

cagaaagcca acagcaaccg aaaggaatct ccaaaacaag caagggtccc gtgccggcga 22920 

taaacggaaa atttccctca cctaaacaca cgcgccgcgt aaactgttaa aacacgcaat 22980 

acttcctagg cctgtgttta ataaaacaca caagggttgt aatttcgcgg ggccctttgg 23040 

gccccgcgcg tctgggggtg cgttgtctcg ttgtggttcg caatcaccac cccgcctgcc 23100 

cgaaaacagg gcaacaaccc cctggcctag ttttttaaaa acttaacacc ggcaagggga 23160 

gagggagaag gggtgcggct aaatgggctg ttagcagcca accgcagcgg ggaaaaaagg 23220 

gggcaaacgc cgggttaaca caaacagcaa caggcgccaa acccaatata acacaacgca 23280 

tattccgtgt gtttcgcggg ccggggtcgg tacggctaac ccgcgttggg cgtttgcaca 23340 

agcacgcccc cgccctgcac ccggctcccc gaggtcaccg aagggtcagg aaacatagtt 23400 

ttcacggtcg ccagtcgcgc gcgtggcaac atctttcggt gccctccgaa cgacccggaa 23460 

aaatccccgc cgcgccgggt cgttcggagg gcaccgaaag atatccccaa acgcaaccta 23520 

aagcatcatg tttggggttt cggtgacgcg cgccgcagag gaggccggtg gtggcgctgg 23580 

cgaagatagt ggccgcagag cacgggtttc ggtagtgacg cgcctggacg gctgccacct 23640 

ggcagcaggc caaaaacacg ggcgccgcga gcggccggca ggtggcgccg cgctttcccc 23700 

cacgccgcga tttggcaaaa tggcggaccg gcagcgccgg ttggacgcgg cggacgtcaa 23760 

tcacgacgga gagaagcgca gttggtcagt tgtcccgtca atcaccgagg gccgcgccgg 23820 

ttggcggata atatgaaacc ggcgcggcca ttggacgctg gcggcagcca atgggcgcgt 23880 

tgggttttgt ttacaagttc cctatatata ttatattata ccttccctgt taagggaatc 23940 

ccatgttctt aaatctaatc agctgggtta tataattaga atatcagtaa ttactattat 24000 

tactgtatta tttattaaat tatttatttt attatttatt taataaatta attcttgcgt 24060 

tattatgtaa tcagatactg ggaaatctac tcccagttta ataaaaataa ataattaatt 24120 

ctattatggt gggggttcta aatattgtac aaaacaacaa tattttttaa atttattcac 24180 

taaaaatgct cccattttac ttgtgacgta catgttagga atgtgggagt ggtttgtaac 24240 

tatgttttca aacaacaccc ttttgggccg tctgaaactc atataagcgg aaggctcggc 24300 

gttaattgcc actggccgct aaccagtcca ccttgccagt tggagtttta ttgctgctta 24360 

tggctttacc tcgcattccg ggcacaagct gatttacgta gcacgcattg ggggtttaac 24420 

ccggttgaat ttagtggttt gcacgtgtaa aaaagcccac cgaggcttgg gtttaagcgt 24480 
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ttaaagcgtt gctggtttta taacatgcga tttaattgct atgggagcta gcttgtattc 24540 

cgcatgcttt cggtttagta gttgtacaaa acagaattgt agcccgcaaa ttgttactgg 24600 

tacccaaaac ggcaagcccg cttaacacgc cccgcgctgc ggcttttgtt cctagcccgc 24660 

tcgcaataca atcgagattt taatttaaaa tggggcaatg gcgtagcgca aatttaagcg 24720 

cgtcaaaaat ttaaaatgga ctaacgtaac cggtgctttt agtgacctat ggcgaatttt 24780 

aaaatttaaa gccgtgggtt ttaacacaga gctgccagct tgtaatacgt tgcatataca 24840 

gtgccgttgt gattttatgg ttaagtggat tttaacattg aattttgtaa tggtgtacac 24900 

gtggattttt aaattgaatt taaaattgta attacgtgcg cgtggatttt aaaattgtat 24960 

ttatgtaaca ttgttaaatt ttaactgcgt tatggttgta ttttggttaa tttatgtaat 25020 

tttgctttta gatttgttcg ggtgtgaatt ccagacaggt aagagcctag ctcctaatgt 25080 

ttgccttgcc gcctagctcc taatgtttgc cttgccgcct agctcctaat gtttgccttg 25140 

ccgcctagct cctaatgttt gccttgccgc ctagctccta" atgtttgcct tgccgcctag 25200 

ctcctaatgt ttgccttgcc gcctagctcc taatgtttgc cttgccgcct agctcctaat 25260 

gtttgccttg ccgcctagct cctaatgttt gccttgccgc ctagctccta atgtttgcct 25320 

tgccgcctag ctcctaatgt ttgccttgcc gcctagctcc taatgtttaa caacattaat 25380 

gtttaagcac actaaaattt aaagacgttt gtgttggttt ttatgaccag cttggtacaa 25440 

aacctgctgg tgatttttta cccaacaaat aataaataaa aaagttaaaa cttatttctg 25500 

ttgtctgtgg taattgggtt ccccgagggt cccgggctcc ccgttccccg agggtcccgg 25560 

gctccccgtt ccccgagggt cccgggctcc ccgttccccg agggtcccgg gctccccgtt 25620 

ccccgagggt cccgggctcc ccgttccccg agggtcccgg gctccccgtt ccccgagggt 25680 

cccgggctcc ccgttccccg agggtcccgg gctccccgtt ccccgagggt cccgggctcc 25740 

ccgttccccg agggtcccgg gctccccgtt ccccgagggt cccgggctcc ccgttccccg 25800 

agggtcccgg gctccccgtt ccccgagggt cccgggctcc ccgttccccg agggtcccgg 25860 

gctccccgtt ccccgagggt cccgggctcc ccgttccccg agggtcccgg gctccccgtt 25920 

ccccgagggt cccgggctcc ccgttccccg agggtcccgg gctccccgtt ccccgagggt 25980 

cccgggctcc ccgttccccg agggtcccgg gctccccgtt ccccgagggt cccgggctcc 26040 

ccgttccccg agggtcccgg gctccccgtt ccccgagggt cccgggctcc ccgttccccg 26100 

agggtcccgg gctccccgtt ccccgagggt cccgggctcc ccgttccccg agggtcccgg 26160 

gctccccgtt ccccgagggt cccgggctcc ccgttccccg agggtcccgg gctccccgtt 26220 

ccccgagggt cccgggctcc ccgttccccg agggtcccgg gctccccgtt ccccgagggt 26280 

cccgggctcc ccgttccccg agggtcccgg gctccccgtt ccccgagggt cccgggctcc 26340 

ccgttccccg agggtcccgg gctccccgtt ccccgagggt cccgggctcc ccgttccccg 26400 

agggtcccgg gctccccgtt ccccgagggt cccgggctcc ccgttccccg agggtcccgg 264 60 

gctccccgtt ccccgagggt cccgggctcc ccgttccccg agggtcccgg gctccccgtt 26520 

ccccgattca acgttaaagt ggtttattta gcaaacacac gttttttgcc gcggtttggg 26580 
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ttgtacatac acgcgggtgc gcattggccc gcccatgcaa aaaaataacg ttgggtttcc 26640 

gaaacgcgta ggtgctttag tgtttcatgg cacgttttcg tgtcacggta ggagtggtta 26700 

gttttttggt agcgaatgca accaggtagc tcgcctgacc gggaaaggga acgttttaaa 26760 

ccgcagtgtt accgcggtgc ggcatacggt ttaacgtgca actgccggtc ccgcgcgtgg 26820 

taccttaagt ttacgtgtag caattatggc ggctgttcag ggccctccgc cgccccctga 26860 

ggaagaaaat gaaaattctt tgccagttga cgtttatgct atagagggca tctttcttta 26940 

ctgtgggctc gggcaggcgg agtacttgca ccatcccgtg tttagcccta ttaaggaatt 27000 

tatcagcgcc tttctcaagg acagcgctcg cctgtacgag aggctcttgc gccacaccga 27060 

ttaccgctct ctgcggggac taaacgccat aggccaaggg atgctgcaaa taaacacgga 27120 

cggacgccac aactggggtc gcgctttggc cgtgttgggt cttggtgcgt atgtggtgga 27180 

taaggttaaa gacgacgagc gtcttttaac gttcgccata gccgttctac ccgtgtacgc 27240 

gtacgaggcg ctggagtctc agtggtttcg ttcacacggc gaatgggagg gactcaggaa 27300 

ttactgcgag cgaatactga ggcatcgccg caacgcgagg agacacatgt gctacggagt 27360 

tgcggctggt cttctggcgc tagtggcgct gtttgccatc aggcgatagg tgtgcgttta 27420 

accgcgtatt cccccaccct gaccttaagt gagccgattg tatatgagac caataaaaca 27480 

aaacaagcta acgtggtatt cgttggaaca ttttttattt gagcagttcc tcgcagaaca 27540 

ttttttgtat ctgtgacacg ggggccggct gtgcgctggc ctccaccggg gcccgcgcgg 27600 

actgtccgca ttctggatcg aggggcgggc acgcgccagc gcggggggca gaggctgaag 27660 

gaatggcgtt tgacattaca gattcttggg cggcgggggc tggagttgcc ggctgcgtgg 27720 

cgagaccggc gtggtggctc gccggctgcg ggggcggtgg ttgcgtttgc tggatgccct 27780 

gcgcgctctg gtacggctgt ggggcgtgga gatgggtgtc cgttccggaa taccactgcg 27840 

gctggtacgc taggtacggc gcagacgcgt aggtctgtgg ccacgagaac tgaacgggcg 27900 

gtggttgcgg tcgttggtcg gcgtcctgtg cgccggaggt ctgcgacgcg gctcgtttta 27960 

ggtccctcag atccgcctga atgtccaata tgtttttgga cagcgccatg acgtccttgt 28020 

gtatgccgac ctcctcgcct ggaaacacgg gctcgtcgga aagctccacg tcccgtttgc 28080 

gcttctggtg gcgagacggt gccggagcgt agccgtacgt cggagaatag tacgtggccg 28140 

catacggaga gggttggggg gcaaacgccg actggaaagg gaacggtggc gcgacgggcg 28200 

gtgcgtacga tgggacggtc agctcaggcg gcggtatcca cgatggatac gccgccgttg 28260 

ccggcatcga gggtggcgca gaaacgtacg tgcggtggcc ctggttgcgc atgttatcga 28320 

gactgctctg caccatggtc agaaaggtgc tcttgggaat ggtgatgttg tcctccggga 28380 

gggcgctcat ggttcgggtg tcgcggttac cgcagtgctg agcgcaggga aactggcttg 28440 

cctttaaata cgtacttctc gcgactctgg ccacgccttt atctgttttg agcaggtctg 28500 

tgcgatctcg gataaatccg gcgtcgatcg ccttggccat cagtgtctca agaggggccg 28560 

cgaattctgg cggtgtcacc tgacgttcgc aaagatctaa acaattgacc gttattttac 28620 

cggcttcctc tcgcgtaaga gaatcaaact tggagacgac ccatgtaggg tctggtccgt 28680 
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aaacggcgat tgacccccta cgcctcccca gtgcgcacag agagacgtgc tgaaacgcgg 28740 

gtgggtgatt ggggtcttgc agctcttctg ggtggaggga cgagagggag agctccggga 28800 

gccatgtgtg taacatctcc aacagggggt cccgaggcat ggggtttagc tggatctggg 28860 

ccgctcgcga gtcccccgcg agccgtgacg ccagggccaa gaaatcgtgg ctcgttagct 28920 

tgcccaggca aaatatcccg tgtgttactt ggaagagacc gattgtgtga cctacgtggg 28980 

cttcggggac gtgctctatg tttatcggta gagggtccgt atacgggagc agggtcgcca 29040 

cgattgaggg ctctaaatac agctcctttt ctatctttgg taggctgacc acgtccacgt 29100 

atcccccaac gtacacggga gtcatgttgg gtcggtgtct gcgtgagccg aaggagatga 29160 

gcgccggtct tcgggggctg atgtggcgcg ttatccgttg tgagaattta aatacgtttc 29220 

tccctggcga gctcaggttt cttcacctgg tgctgtgcga aatgtacaac tacggtctta 29280 

acgtgtacct tctgaaggag gcaattgcca ataccggaac gagggacgat attgtcctcg 29340 

ggcgtaaggt tccggtggag ttctggaaaa tcgtttacga tgggttaaag gagatggggg 29400 

tttctgacgc aacgttgctc tcagaaacca aaaggggggc tctgtggctt tacttcaacg 29460 

ggcgtccgtg cctgttaaaa. ggtctaggcg actacgtttt ttgccaactg gggttaagcc 29520 

acagcgtacg cgtggttccg gaaaacctaa cggatgggaa ttatctctat aacctaggca 29580 

gcgtcatacc atgtcgcctg cttgtggcac tgagctactg tttggcgttt tgggggcacg 29640 

ctgaccacga gccctgggtc aggttgttcg ccggcaagat ttttatactg tacttaataa 29700 

tttccggcca cattatgcca cggaagtcaa tcttagaaca ggtgggaacg tccgggtacg 29760 

gtggcttcgt cgaggcagtg tgtcgcgatg ttcgcgcggt acacgggatt ccggcgtggg 29820 

attttgcgtc cgcggctccg gcgttaacgt ctcaacagac ggactacttg tttgcgttca 29880 

ataacagcgt agtgtagggt cgcctcacac taccgacgtg gcgaccggca ggtatcccag 29940 

gaccaaaaaa taaactctgt cgtagtcgtc gtaaactccg aattggcttt tcattatctc 30000 

gtgtgccaga ccggggggcg atgggtgcga aaggatggcg gccaggcctt cctcggccgt 30060 

taaacgtagc gacgtttgag cctgaatcag gctgccagcg tccctgaacg tgagcttttg 30120 

gttcaggatg tcaaagattt cgttaaatct ggaaccggtc aggttgacga acggacgagt 30180 

gtgttttaga tgggatatgg cctttgcctc cagcgccgct agatacgcac ccgggaacaa 30240 

aaacgacgtg ggggcgtctg gcgcgcgcga aagaattggg gagatgtaat tttctgtcag 30300 

gaaatcaaaa agttgtccct ttttaaggta ttgtttggac ccggacgggt cttctacctt 30360 

ggaaaatctc ccacccagga gcgaggccag gttgaggcgc cggtggggcg atctgggtgt 30420 

gatttgggcc agctccaata gcacgtaaac caaaactaaa cagcctagtg ccgagaggcc 30480 

ggctcggagg ttccattggt acgcttgaaa gggaggcacg tcttcgctga acaaagtggt 30540 

cgcaaggatg tatacgagat ctggtccagg tggctgaccg gtaccggggt tcttggaacc 30600 

gggtagcagt ccgctctgtg ccaacgtcgc gtatatcacg tggtttgtaa agaagtccgc 30660 

gttgtacttt gtttctttgt ttaggggcgc tagagaggcg cgttgcctcg gattcgaata 30720 

tgtaaactcg aaacccaggc ccttcgtgtt ttgttttagg tgagagacaa gcgtcgccag 30780 
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ctttgtgccg 


aggttatccc 


agaggccgtt 


tcgagttggc ggcaacgggg 


tacctgtccg 


30840 


gtgacagtag 


tacccgttga 


ggaggcacag 


cgcggcctgc gtgtccgata 


cgttgcggac 


30900 


atccgcgtaa 


aagtcttgtg 


tgatcgaggc 


ggctgttgag attacggcag 


acattagctt 


30960 


tagagacgtg 


gagttttgaa 


agtttgttgt 


gcctcgcaaa tctctgggga 


acaccctgcg 


31020 


ttgcatggcg 


ttatctgtga 


ggctactgta 


ccacggcccg tattgtggca 


gccattggtt 


31080 


ttggttcatg 


tacaggcttg 


gcaggagctc 


cagacggaga tcctggttta 


ccgtgaatcc 


31140 


cgggtctccg 


ggggctattg 


tgacaacgaa 


cgcgtctcca tgtgtctgcg 


gggcgatttt 


31200 


gggttgcggg 


tttagtgatg 


tttgggcatt 


ggcggtttcc ctgcccgggg 


accgcagatc 


31260 


cccgaccatg 


ttttgaattg 


tttctaggtc 


gcgcatgacc acttctgttt 


cccgttggtg 


31320 


cgtttgcagc 


acgttgtcta 


gttctagccg 


gagcaggttt tttcgcagcg 


tgtgaaccct 


31380 


tagcgccagc 


tgttgatttc 


gcatcacaaa 


cgcgtttctt ctttcttcta 


tcagagatgg 


31440 


tttgtgaata 


aacagaacgt 


tgtttggatg 


gggggcccaa aaagttgcgg 


taatttgccg 


31500 


cgctggtttt 


cgtgggtatc 


tcatacagca 


tttttctgac gtgcgcatct 


tcccattggc 


31560 


taaaaaaccc 


gttaagcgcc 


gccgacgtgg 


tgttgataaa tgtaggcgga 


aacgctgatg 


31620 


actcagtatg 


aattgtttta 


agtgttttct 


ggttcttaaa tagcagccag 


ggccttaccg 


31680 


accagttttc 


gtgtcctctt 


ggcgcggctc 


tgcctatata tgccacggaa 


tctgtcagtt 


31740 


ggtgtaagcc 


ctgccatcgc 


tgtgatattt 


ttattgcgtt gccgggaaat 


acagccgcgc 


31800 


ttaggcacgt 


cttgagttca 


attacgtagc 


acgttttttg atgttcgcta 


ttaacggtaa 


31860 


gcacacaatc 


tgctattcgc 


ctacctaggg 


aaacctcgaa aaaaatccta 


aatttaacgt 


31920 


ttttggttgc 


cctgagtgtt 


gggcgaattg 


aaagaaactt taaaatatca 


tcaagcgacc 


31980 


ggtgtttgat 


taaccgtcgg 


tatactgcta 


agtgggcgcg agttccagcc 


cgtttgcgtg 


32040 


aggccggaag 


ggcttcgagg 


gtcgcgacgt 


gcttacattg atttgcaaac 


gccatggccg 


32100 


aaggagggtc 


tggctttggc 


gacgagcttg 


tccgtcagat gcgggatcgg 


aagcctagat 


32160 


gggacgagtc 


ttccgacgat 


actgatgacg 


ttgacaccga aagcaccgat 


cttgagtatg 


32220 


atgatgtgtt 


ccccgtggta 


gatacgcacg 


gcttaatgag ccctggaagt 


caaaactatg 


32280 


acgtacccac 


gtctccgtcc 


ggaacgccgt 


gggaattact gcacccggac 


gccctgtatg 


32340 


cacatccgag 


gtgcccgcct 


aaaagggcgg 


tggtgccggg cggtggcgcg 


cgtcccaagg 


32400 


tgtccgcgtt 


ctcggctaga 


cttcaatatg 


ttggacgaca gagttttggg 


gatagagaga 


32460 


cgcggcagct 


taccggggct 


cagttttctt 


ccgagagcga gcacgaatat 


gcggaaatac 


32520 


cggagcgtac 


tactacacgc 


ccggttgaaa 


gcggtgacaa gagaaatttc 


acctccgggc 


32580 


gtaggggtgc 


aatctcggga 


ccctcgtcga 


caaaacctag tcacggtgcg 


gggttaacac 


32640 


gtaagactaa 


aacgtcgctg 


agtgttagtc 


ttaaaaactt gctgcggata 


aaggacgatg 


32700 


atgctaaggt 


tgatgtaccc 


aggccggtta 


ccgtgccggt tcatctcatg 


cagccgcatc 


32760 


ctatgacgga 


atatagaaat 


gcctttttaa 


tctatctaga gggggtcatg 


ggtgttggaa 


32820 


aaacaacgct 


attgaattca 


atgactggca 


tggtgccgca ggaaaacgtc 


ttaagctgtc 


32880 
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ccgagcccat 


gaaattttgg 


acgtgtgttt 


attcaaattg ccttaaagaa 


cagcgcagca 


32940 


tagttaagca 


aggcacccac 


gggaaattga 


tcacttctgc tcgcgtatac 


gcgtgtcaga 


33000 


gcaagtttgc 


gctaccgttt 


cgtgcgacag 


ccgccggcat cggtcgcaac 


ctgcaaccgt 


33060 


ggctggtggg 


aaacggtagc 


acaaagccag 


cgaattggat tgtttttgac 


agacacctac 


33120 


tgtccgcaac 


ggtggttttt 


ccgctggttc 


acgtaaagta caacaggcta 


acgccggatc 


33180 


atctgtttca 


aatcctatct 


cttttctcgg 


cacacgacgg cgatgtggtc 


gttttgctaa 


33240 


cgctcaacag 


ttcggaggcg 


cacaggcgca 


ttcaaagtcg aggccgtaag 


gaagaaaaag 


33300 


gaatcacgca 


aaactacttg 


cgacaggtag 


cgtgggcgta ccatgccgtg 


ttctgtacgt 


33360 


gggtgatgat 


gcaatatctc 


acaccggagc 


aaatggttca gctgtgtgta 


caaactgtgt 


33420 


ccatagagga 


catatgcaat 


atgaattcca 


gattgactca tcggtttcta 


accctaacaa 


33480 


aactacatga 


acagagtatg 


atacccatgg 


tggcagaaat gttagtatcg 


gttaaagaac 


33540 


acgtgacctt 


aatggaggtc 


tgtttgggac 


tctttaaaga gctacgaaag 


cttcaaattt 


33600 


taattgttga 


cgcaggagaa 


catctagatg 


atgcgtgtgg cctctgggga 


aatatttatg 


33660 


ggcaggtgat 


gtcaaatgag 


gctattaaac 


cacgagcagt gaactggcca 


gctcttgaaa 


33720 


gctacattca 


aacgctaacc 


aaattggaag 


gcaatggcgc gtattagttt 


tatatttttt 


33780 


tttactataa 


taaggtgttc 


tgtaacggac 


aaatacgtgt atgatgaaaa 


atcaaatgta 


33840 


gagttggaat 


ttaatggaac 


aatttaccaa 


attaattgga gaaacgtaag 


caaagaatta 


33900 


acatctattg 


taatggaaga 


tgcgtggtac 


gattctttac tgctggaacc 


attgtctgtg 


33960 


actttggaaa 


aaagaaaaag 


tttgctgcga 


tcaagtatag ttaatgtgca 


taataatgac 


34020 


tatacatttt 


gtaaatcttc 


aagtgaccat 


gtaattaacc ttacggtgga 


ttttaattac 


34080 


agttctttgc 


caggttttac 


tgggaatttt 


aatgtaatga cacatgcttt 


aacacaaggc 


34140 


gttttattaa 


cgaaacgtga 


attatttacc 


aactcaacaa acattatgga 


tcttttttac 


34200 


gcggaaaaaa 


taaatgcaga 


aatgttcaag 


attacttttg attattccaa 


tgtaataatt 


34260 


tccggcatta 


taacagaaaa 


ttggatatta 


gtatcagtta caaattcaag 


tgtaaaatca 


34320 


aatatgcaat 


gtgttgcact 


gttgtttggt 


gttcctagca catttccagc 


gttaaaaggc 


34380 


tatgtatctt 


atcgtgattt 


attggtcgtt 


aaaaattcaa actatgctct 


gggtgtaatt 


34440 


gcgccgaaat 


cttataatac 


tcttgatttg 


gcatttttac caaaaaattt 


tacagaaatg 


34500 


tttgtttcag 


ttatagatag 


tccgcttaat 


gccattgatt atctcaaagg 


aaaacttttg 


34560 


gctattgaag 


caaagggtgc 


ctgtcaaaat 


ccttcaaatg aaaatgatat 


cttatctttc 


34620 


ttttttgaag 


ttactgcagt 


aaatttttta 


tttataaaaa atcttcaaaa 


gcagcaatta 


34680 


gttaacgttg 


ggtgtgttgt 


acggcatgtt 


gcagcgttag aaagtttaat 


gcatttatta 


34740 


aggctctgct 


acccaacgtt 


taagttatat 


gagctaaatt tggaaacgtt 


gtcgcatatt 


34800 


gcagagtccc 


aggtttttaa 


cttgccggca 


aatagcatgt tatcgttatc 


agtaaatgat 


34860 


caagaggtcg 


tattttcaat 


gtttaaaatt 


gtatataata cacctaaggt 


tggtggtaaa 


34920 


attctaaacg 


aaattgtata 


tattactaat 


tacatgtaca ccaaatacag 


tgaaaattat 


34980 
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cagctaacca acacatttag gcggaatgta atgaacatgt atgaagtttt aacaacaatt 35040 

aaattaaacg tcacagattc ttctgtgttt tatccgtata tattgttcac atccatgtgt * 35100 

aataatgtag agatatcata tatgattaat caaatagcaa agcccgatga catcaccata 35160 

tttcgcgtgt tttcgccttg ttttttaagc ttaaggtttg atttagatga aaacaagcta 35220 

cgatccgacg caccgcagac gtcaaaaaga accggctcgg aactggcaca aggagcgtct 35280 

ggtttttggc ggctgttgca cgcttttcac gccacgcgta taaacgagtt ttcagtcata 35340 

aattgtacca ggctggcatg gaagcaagta accgcgctga tgccgcttac gaacatcacg 35400 

tacgtcataa gctccgtgcg ccctgatcac gctcgcgttt acgaggtttc ggaggtgttc 35460 

cttaacagcg cgatgtttgt ctcggccgtt tatccaaatt gttctcattt tacaccgccc 35520 

ggtaccgccc tacacattcc gatattgtat aacttttccg ccccgaggat aggatgtccg 35580 

ctgtgtgatt cgattgtttt gagctatgat gagaaccagg gactacagac catgatgtac 35640 

gtgtcaaacc ccacagttca agccaacctg ttttccccgt attctccttt ctttgacaac 35700 

gataattttc atatccatta tttgtggtta atgaacaacg gtaccgttgt ggaaattcgg 35760 

ggattgtaca gaagacacgc actcagtgct attgcattag tatttgcttt tattggaaca 35820 

atgtctgctc tgtattttct gtttaagctg ttttccatcc tggcctaaac tgttaataaa 35880 

gcgtaaaact ttaaaaggtt gtttcccgtt tctttttgga tggttacgcg gggactgaga 35940 

ttagcgagaa cctcggatag aggcgtgccg tcacatattg attcaatcac ctctgcgcca 36000 

acttgaatgg ccagctctcg attttgggga ggaaacatta aaaaatattt gtcggtcgcg 36060 

gttaataatc gctgcggagc ccgcgacaaa actcgaaaat ccttctgaaa gtcgtgggac 36120 

tctatggagt ttaaacaaaa caccatgtct tccaggcgct gttttacgcg cgagagcatg 36180 

gattcgaagt attctctctg atacgggccc gccgccttac acgtattcgc gtatagcatt 36240 

ttaaagtgaa acagaatatt ttccaaaccc aaacgctgca ataggacgtg ctttaaaaaa 36300 

aggtgttgga taggctttag ttttaaatta ggtaaaaaca ttatagagtt gttggcgcgc * 36360 

tgcataattg tcacggtgtt aaacgcggta atcgggctct cgcctatttg tggctggcgc 36420 

gaacccatgg caaaaaatat gtctgccagt aaacagtccg actgtacatt tgactcgctg * 36480 

gtcgctggcg gttgcttgag gatggatcga agttttttat tatccggcgg aagcgcgcta 36540 

cataaggtcg gtggcgcatt tggatcgtat tcttgcgtgg tgtttgccag ctcgtttatg 36600 

gttatctgtt tgttatttgt aagagttacc ggtggggtct tgacggtcgc cgggacggaa 36660 

aatggcctgc agtgcaaaaa aagagagtta ctcgcaatca ctaattttga gtttacggta 36720 

attggtgtgt cctcaaagtc agaacttcta attactgggt ctctgaacaa tgatacgcct 36780 

ctctcgcctt cgtccgttcc cacgcacaag cagctgttta gcgcgtaggg atggggtgac 36840 

accataattc gcagcattga taattccgga cgttttgggg cgtacaatgg catcatgttt 36900 

gaatagatgg tgctgaattt tttaaacaac aggggaaggg gtacggccgg tcccggaagt 36960 

ccgaccattg cggcaagtgc cgacgagtac acgacatgac agacgtactc gccgttggat 37020 

agctgcaccg cccccccaac taagcgagcc tttaaatctg agatttttat cattgatttt 37080 
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cctttggtag 


tttcaggaac 


cgcagcggcc 


ttggcttcgc ggttatattt 


ttgctttatg 


37140 


tctccagtga 


agtcccaaac 


gtctaatagt 


gtgtcgaatt acctgcggtt 


ttaagtttga 


37200 


aaacgcgtga 


ccggtcgtag 


ctatgaaaga 


cataaaattg gcgttgcgtc 


ctaactcgtc 


37260 


taaccgtatt 


tttttacttc 


gcacgtatat 


ctttggaatg aggttgcatt 


ttttatatcc 


37320 


cagaatagcg 


cattcgtgtt 


tgcttttggt 


gtgactaatg atgcgttttg 


aaaagtcaaa 


37380 


cgtaccaagg 


cgggcggcgc 


ggttcagggc 


tcccagatcc aaacgcaaaa 


acgaacaaca 


37440 


ttccaccaaa 


cactcgataa 


aacgcttgtg 


tagagtttgt atttgcgctt 


tatttcttgt 


37500 


ctccagcgtt 


gtgaatattg 


tgtttttcac 


ctggtctatt ctagcatatg 


gaacgcttaa 


37560 


tattgggttc 


aattcatccg 


aaagtttttt 


aagtaagggc ctaccgtata 


ttttagaaat 


37620 


tgtcgcagtt 


gccgccttaa 


accaggaaac 


gttgcatccg tctttactag 


aaaccacaga 


37680 


agggaaatta 


gtggtccaga 


aaacgtctcg 


ccgcccccgt ctaactatgt 


attggttttc 


37740 


aagcccgtgg 


aggtctaacg 


tagagttcca 


gttcgcaatc gaaaccggga 


aaatgtgctt 


37800 


caccggtaaa 


aagcacctga 


ggtttcccat 


agcgaacata aagtgcagca 


tgtccccgct 


37860 


tatgttaacg 


ttaattgcag 


ccccgttgca 


caggttgtcg gagtagaaca 


cactcacggt 


37920 


gaatgacggt 


tctttcacgg 


agaatttttt 


cattacaaag ttattagtga 


gcctggggat 


37980 


gtcgataatt 


gactcgaagg 


taccggcgag 


agaccagcac gtgattttgg 


tattaattac 


38040 


catattggta 


ttgaaaactg 


agattttgaa 


gttgtgtaaa ctcatgtgct 


taactggatc 


38100 


gaacatgtct 


ggaagaggtt 


cggtctggtg 


gtcgtgacgc gcttgttcat 


atcgatctgg 


38160 


gttttgatcg 


gcgttaaaaa 


taaaattgta 


atctctatct tttggtgcgt 


attgaattcg 


38220 


cgaaaacatg 


gcctctagga 


cgcgaggcgt 


ttcgcctggc cccagaaact 


cggtgttagg 


38280 


atacagcctg 


tcgcacgaat 


ggcacacgtg 


ttccgccaca gacatgcaca 


ttgcgacgcg 


38340 


gaatattggt 


acctcgggtg 


gaaacgttag 


atagtacgtt gactttggcg 


tcacaacaac 


38400 


taccttgtac 


ggttttttgc 


cgggatcctt 


tttgttaatt ttcacctgta 


gatttacgtc 


38460 


aagcgattga 


tttactatgt 


tttggtagtc 


tatgattaat t.cgctcgcca 


ctttgatgtc 


38520 


ctgaaatttg 


ttttgctttg 


agaggttttt 


taaaagtacc ggattggatt 


cgatgttgtg 


38580 


aataattacc 


gtttggacgc 


gatgcagttc 


gggatgcgaa atccaatccg 


aagtaaaacc 


38640 


caaaatgtta 


aattttaagt 


aattcagatg 


cagttttgca cacgggggcg 


gacaccgagc 


38700 


tagcgtgagg 


attttgaatg 


gctctaaaag 


tttaaagtaa ttctgataac 


acgtgccgat 


38760 


cagatgttcc 


ggcagtgtgc 


gcgtaaaggc 


agcaaagcat tcctcgaatt 


cccatatttg 


38820 


aaggtaacga 


cagaggattc 


gaaacatgct 


ttgataattt acgccacata 


ccattaatgg 


38880 


cgaacagtac 


agcgaccgta 


ggattcgttc 


gtcgctttga accaaaaccg 


tgtcggacag 


38940 


ttcgtaggtg 


aaaacggtgg 


gtaaaaacgt 


atgcgtctcg atatgggaaa 


atgtacatcc 


39000 


gatagcatca 


tcaattatta 


aacccggtcc 


cgcgtactcg ttaccggtct 


tgccaacgat 


39060 


accaccttgg 


cggtgtataa 


aagggtatag 


tccttctccc aataccagac 


gccggagcgc 


39120 


acggcacgcg 


taaaaggtgg 


tctctgtttg 


catagccctg tccaaggcct 


ctagcgagac 


39180 
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gttttcaaga atctcattat tggtggccag gcagcaaaaa attgccagtt gagttttaaa 39240 

atcggaattg gccggtgacg gggcgttcgc tgccaccgta gcaggacacg cgggcagaag 39300 

aacagggccc tgcaacagca taatggaggc ggctctggag gtacgaccct ttccatatat 39360 

ggccacggag gccaatctgc ttcgtcaaat gaaagagtcg gcagccagcg gactttttaa 39420 

aagttttcaa ctgctgttgg gaaaagacgc cagggaggga ggagttcaat ttgagggtct 39480 

tttgggcgtg tataccaacg taattcagtt tgtgaagttt ctggaaacgt cgctagcggt 39540 

tgcgtgcgtt aacaccgagt ttaaggacct caaacgaatg acggatggaa aaattcagtt 39600 

taaggtatct gtaccgacca tcgcgtatgg ggacggcagg cggcccacaa aacaaaaaca 39660 

atacattatc atgaaggcct gcaataagca tcacattggt gccgagatag agctgtcgac 39720 

tgatgacatc gagctgctat tcattgacag agaaacccca ctcgattaca cagaatacgc 39780 

cggggccgta aagacgatta ccgcctctct ccagtttggc gtggacgcgc tggagagggg 39840 

cctggtagat accgtattga atgttaagct taggtccgcc ccgccgatgt ttattctaaa 39900 

aacactatca gacccggtct acaccgaacg gggtctaaag aaggctgtta agtcagacat 39960 

ggtgtccatg ttcaaaagct accttatgga taactcgttt ttcctcgaca aatcagacat 40020 

cgccgtcaag ggaaagcagt acgtgctgtc ggttctctcc gacatggtgg gggcggtgtg 40080 

tcacgaaacg gtttttaagg ggacgaatac gtatctgtct gcatcgggag agccaattgc 40140 

cggagtcatg gagaccacgg aaaacgtaat gcgaaaactg ttaaacatgc taggtcaggt 40200 

tgacgggggc atgtccggtc cggcgtctta cgccaattac gttgtcaggg gcgaaaatct 40260 

cgtaaccgcc gtgacgtacg gtcgcgtcat gcgaacgttt gaccagttca tgaaacgcat 40320 

agtggaccgt cccaacgcgc agcccagcgt cgacgatgat cgggacgcgg tggcggacgg 40380 

gcaggactcc ctcgccaaaa caccgatcgc ggcggctgtt atccagattg gagataaact 40440 

ggtggctttg gaaagtttac agcgaatgta caacgagacc cagtttccct tcccgttaaa 40500 

caggcgtatg cactatactt attttttccc cattgggttg cacatgcccc gtcctcagta 40560 

ttccacgtcg gccacaatta agggggtcga gcacccagcg gaacaatccg tcgaaacgtg 40620 

gattgtaaat aaaaacaacg tactgctgag tttcaattat caaaacgcgc tcaagtccat 40680 

ctgtcatccg cgcatgcaca acccgatgcc gtgtgggcaa gctttgggtc aggcgtttcc 40740 

agatcccgga cacgttcaca ggtacggaca aagatctgag caccccccga acatgaacct 40900 

atacgggctg gtgtacaact attaccaggg caaaaacgtg gcacacgttc cggatgtcgc 40860 

cctaaaggcg acgatgacca cagatgagtt gctgcaccca acctcgcacg agacgcttcg 40920 

tttggaggtt cacccgatgt ttgatttttt cgttcatcag cagcctggtg cgcaagccgc 40980 

gtatagggcg acccacagga ctatggtggg taacattcca caacccctgg cgcccaacga 41040 

gtttcaaaac agcagaggcc tgcagtttga cagagcggcg gccgtggctc acgtgctgga 41100 

ccagtcaacg atggaaatta tccaagatac ggcgtttgac acgtcgtacc cactactctg 41160 

ttatgtcatc gaatgcctca ttcacggaca ggaagacaaa tttttgatta attctccttt 41220 

aattgcatta accattgaaa cttactggaa caatgccgga aaactggcgt ttattaacag 41280 
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cttccctatg ctgcgattta tctgcgttca cctgggcaac ggtagtattt ctaaggacgt 41340 

gtacgcccat taccgaaaag tttttggcga actcgtggtt ttgcagcagg cgctctcgaa 41400 

aatcgcgggc cacgaggtgg tggggcgcag gcccgcgtcc gagctgatta actgtcttca 41460 

ggaccccaat cttttgccgc cctttgctta caatgacgtt tttaccaacc tgctcaggca 41520 

gtcctcgcgg caccccatgg tactcatagg cgacgagggg tacgaaacgg aaaatgacag 41580 

ggatacgtac atcaacgtca gaggaaaaat ggaggaccta gtcggtgaca tggttaacat 41640 

ttacgagacc agaaacaacg cggatcatga cggccgccac gtccttgacg tcggtccctt 41700 

taatgaaaac gaacagcaca tggctgtgct ggaaaagctt ttttattacg tggttctgcc 41760 

agcctgtacc aacggtcacg tctgcggcat gggcgtcgat tttgacaacg tggccctggc 41820 

cctgacgtac aacggcccag tgtttgctga cgtcgtgaac cccgacgatg agattttgga 41880 

ccacctggag aacgggacgc tccgcgagat gctcgaggct tcggatatac accccaccgt 41940 

tgacatgatt cgaactcttt gcacgtcgtt tctcacctgc ccgtttgtta cccaggcctc 42000 

ccgtgttgtg actcagcggg accccgcgca actgttgacc actcacgacg acgggagata 42060 

cgtgagccag actgtcctcg ttaacgggtt cgcggcgttt gctatcgcag ataggtctcg 42120 

tgacgttgcc gagaccatgt tttacccggt gccgttcacc aagctgtaca gcgatcccct 42180 

ggtggcggcc acgctccacc cgctggtcgc aaattacgtg acgcgcctgc cggcccagcg 42240 

cgtgccggtc gcgtttaacg tccccccggc cctcatggcc gagtacgagg agtggcacaa 42300 

gtctccaatg ctggcctacg ctaacacctg cccgatgacg cccacgtcgt tgagcaccct 42360 

ggcgagcatg cacatgaagc tgtccgcgcc ggggttcatc tgccacgcaa agcacaagat 42420 

tcacccgggc tttgcgatga ccgccgtccg aaccgatgag gtgttggcgg agaacttgct 42480 

atttagtgcc agggcctcga cgtccatgtt tttagggcag ccatcggtta tgcgtcggga 42540 

agtcagggcg gacgcagtca cgtttgaggt gaatcatgag ttggcatcgc tggacatggc 42600 

gctcggttat tcttccacca tcacgcccgc ccacgttgcg gcgattacct cggacatggg 42660 

cgttcactgt caggacatgt ttctcatgtt tcccggggac tcgtaccagg acaggaccct 42720 

caacgactac gttaaacaaa aagccggatg ccaacgattc ggtggtcctg gccagattcg 42780 

tgagcccgtc gcttacgttg cgggggtgcc gcactcggac aacataccgg gtctcagcca 42840 

cggacagctg gccacgtgtg agattgtttt gacgcccgtt actgcagacg ttacctattt 42900 

tcaaaccccc aacagtcccc ggggacgggc atcctgcgtg atctcgtgtg acgcgtacaa 42960 

caacgaaagc gcggaacgtt tgctctttga ccactccatc ccggattctg cctacgaata 43020 

ccgcactacg gttaacccat gggcgtcgca gcagggctcc ctcggagacg tgctgtacaa 43080 

ctcaacctcg cgccaggtcg cagtgccagg gatgtacagt ccgtgtcgcc agtttttcca 43140 

caaggacgct attttgcgta acaatcgggg cctgaacaca ctagtcacgg aatacgcggc 43200 

ccgcctcacg ggaacgccgg cgaccagcgc gacggacctg cagtacgtgg tggtcaacgg 43260 

aacggatgtg tttctagaac aaccgtgcca gtttctacaa gaagcgtttc ccacgctcgc 43320 

cgccagtcac aggtctctgc tggacgaata tatgtcgaat aagctcacgc acgcccctgt 43380 
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gcacatggga cattatatga ttgaggaagt ggcccctatg aaaagactat taaagatcgg 43440 

aaacaaggtc gcctattagt ttagctcaga cggtctggag ctaacgagag atggccctcg 43500 

ataagagcat cgttgtctcg gtgacgtcta gattattcgc cgacgagata gcaaatcttc 43560 

agtcaaagat aggatgcatt ttgcctctca gagacgccca ccgtctgcag aatatacagg 43620 

cgctgggtct ggggaacctg tgctctaggg attccgcggt ggattttatt caggcatatc 43680 

actatttgga caaatgcact ctcgccgtgt tggaagaggt cggtcccaac agtttacggc 43740 

taacgcgcat tgatcccatg gacaattatc aaataaaaaa cgcgtaccaa ccggccttcc 43800 

attgggataa ctactcagaa ttggtagtta taccaccggt ctttgggcgc aaagatgcga 43860 

ccgtctcact ggagtctaac gggtttgatg tggttttccc tgccgtggtg ccagaaccac 43920 

tggctcaaac agtgcttcag aagctgctgc tgtataacat atactacaga gtggcggaga 43980 

cgacgcccac cgacgtcaac ctagccgagg tgacgctgta cacgaccaat atcacttaca 44040 

tgggtcgcaa ctacgccctg gacgtggacc ccgttgggtc gagctcagct atgcggatgc 44100 

tggacgacct gtccatttac ctgtgcgttt tgtccgcgtt aattccgcgc gggtgcgtaa 44160 

ggctactgac ctcattggtg cgccacaaca aacacgaatt agtcgagatt ttcgaggggg 44220 

tggtgccacc tgaggtacag gccctggatc tcaacaacgt aagcgtggcc gacgacataa 44280 

cgcgcatggg tgccctcata acctatctac gaagtctcag ttctatattt aatctgggcc 44340 

gcagatttca cgtttacgcg ttctcatcgg acacgaatac cgcttcctgt tggtgtgcat 44400 

ataactagaa acgggcctcc ctgtgctttg acatgtcgat ccccaaaatt atgacggtgt 44460 

ccagagacaa cgagggtacg gtgtgtgaag tcgcggtgga caacggacga cacagagcga 44520 

tgatttatta ccctaagacc accaacttag caaacgagcg cgcggacgtt gttaaggaag 44580 

cttttgatac cgaaacccca gtggacattg taaagcaaat tgttaacgag ggcctagcta 44640 

tatccaaaaa aaattgcgtc cgtttggcgt tgtatttata tttttatttg cagtacgtgt 44700 

gctttgctct gctcctcact tggcagttaa acccgtacat ggacccaccg ggtctggtgt 44760 

ttgcggttaa ccccatgggt ccaaaacatg tcacgaaact accgcacccg gctattgttg 44820 

cggtaggttg tggggcagac gccatctgta agaactgtag cgtccccgat atcaaaacgg 44880 

agcttggaat ggtttaccac aacgggtcta gcgattctgg tcagcgcgca cactatgggc 44940 

tggccctgtt aaaggcggcc tggcttgtca tgggaaatgt gtgtccggaa ccagtagtgc 45000 

ggcaaggcgc tgcattactt ggtccatgga accggacgga gtggtcggat tttaaatcgg 45060 

caatggcggc aaccacgttt tgcggatcca gaggcgttct gtggtcaccg attcatgaaa 45120 

aaaacctctg tcgccccacc tggaatgatg taattaacac atcagttttt acaaatgaat 45180 

cactctgtcc aaatatacct gtggtgcccg aaagtgtaat agtgcttaat ggtgatgcat 45240 

gagaacaata aacgtattcc cacgcacttc atgtacgttg tttttattgg tctggtaagg 45300 

tattaacagt aatgggaggt tccgctggtg cctataaagc aaaacggtca tagagtaaca 45360 

atattgtgaa tggggaaaat agtctgtaaa tagtttatcc gaacagtatg actgcacaca 45420 

cgaatggggt tttaaccacg acgggctttt caacaagtca gccggaatcg gttcaagttt 45480 
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ctccatttta 


tcgcgtaatt 


acaaaacctc 


ccgttatggg cttgtttttt 


tgcgtggcta 


45540 


tgtgcgttat 


cgcgttggta 


tggtacgtga 


tgcggagggt gtgttgtaag 


gggcgcgttg 


45600 


ttgccgattc 


gtgtcgcgac 


ccgcgtcaac 


ccgcgtatga gatgttgaat 


gttaggttgc 


45660 


gtccccacgg 


aaccaatcca 


tagagaactt 


tcacgtacat agccaaagcg 


cacgacgtgc 


45720 


tctctagctc 


gtttattcta 


gcgatttaaa 


aacaaattta tcgttggttg 


ccatgtagtg 


45780 


ggccatgacg 


gcggcgacaa 


gcacgtcgtc 


cgacatggtt ttttgtttcg 


cgcagtacgt 


45840 


gtgacctccg 


tcttttagtg 


gaatgcaacg 


tattgccttg atttgatcta 


ttagatacgc 


45900 


tacgggatca 


aacgatagct 


taatagtgtg 


agacaccaca gtttggctag 


cgctaaacgt 


45960 


tcccgagttc 


aatgcgtaaa 


taaaagattc 


aaacgccttg gctttctctg 


gccccaacat 


46020 


gtaaatgggc 


gaacgtataa 


gagtgttctt 


gtccacgtgg tgtaaaaaac 


taagagggac 


46080 


cgagcaaatc 


tcgtttaaca 


cggttgctat 


ggccactccg gcatcttgac 


tgctgttgcc 


46140 


ttccacggcg 


acgttaacgt 


gcgtgatctg 


cgggtgaagg gtaacgatcg 


ctcgaattaa 


46200 


cgcggcggca 


catgacgcta 


tctggtacgc 


ggcggtgccg gttagatctc 


ttagaaaaaa 


46260 


atgttccacg 


cctaataaaa 


tgcatttaat 


caccttgtga tttaccgcaa 


tcaccgcacc 


46320 


gatgccggtg 


ccggacgcgt 


ctgtgttgtt 


ggtgtacgcc gggtctatgt 


aaacgtggag 


46380 


acacggcttc 


atcgcgccct 


gtatatcttg 


tgacgtgctc tttacgcggc 


acagatccaa 


46440 


ttgtgacagc 


gaagagtcgc 


taacgatttt 


atgcatgctt tgagcggacg 


tggcagcgtc 


46500 


ccccattagc 


tccgtggaga 


atgcgccttc 


caaaaaaaga ttagtcgtgc 


tccgtaccgt 


46560 


ctcatcaatc 


gtaatatagg 


caggaatatg 


caggcggtag cacgggcacg 


ccacgacagt 


46620 


gtcttgcaag 


ttaaaatcat 


ctttatgatc 


cggacatacg tagttcacca 


cgttaagcat 


46680 


cttttcgtgg 


gcgtccttca 


aattaagtaa 


aaaactagtt gatttgtcag 


atgagttact 


46740 


tgaggatata 


aaaatcagtt 


ttgcgtcctt 


ttgaagcata aaacccagaa 


tcgcggggag 


46800 


ggcatctttt 


ttaatgaagt 


tcgcctcgtc 


gatatatagg aggtggaacg 


tctgtccccg 


46860 


gatgctctaa 


aggcaaagaa 


aaacacaaaa 


cgagttacgc gggtatggag 


aacgatacgc 


46920 


ctaaggacaa 


aatctcggaa 


gctgactttc 


aacagtgtca ggcgttcttt 


caccgtccca 


46980 


ttagagatct 


aatttcatct 


ggagctgacg 


ctttaaacca ctttagccta 


tctgaatcag 


47040 


acggacataa 


attggaacgg 


attgttcttc 


tgcttgacct ggtggggaca 


gaatgtctct 


47100 


cttataccac 


gatcgctgca 


aagaatgtca 


aatgacgcgc gtcaacagcc 


caatatgtcg 


47160 


atttcataac 


gtctctaact 


tataccagtg 


tttggattgt aagcgctatc 


acgtatgcga 


47220 


cgggggacgc 


aactgcgtga 


tcgtgtacac 


tcgcgaaaat ctagtgtgtg 


atttaacggg 


47280 


aaactgcgtt 


ttggataatg 


tgcaggacgt 


atgttcgtac ggtcctccag 


aacgccgcgt 


47340 


acccgacgcc 


ttcatcgatc 


cgctcgtgtc 


acacggcacg agggaatgtc 


ttaaaagcga 


47400 


tatactgagg 


tactttgaga 


cggtcggtgt 


gaaatctgag gcatattcta 


ccgttgtcaa 


47460 


gaatggacaa 


ttgaatggca 


tcataggtag 


attaatagac gctacgttta 


acgagtgcct 


47520 


tccggtaatg 


agcgacggcg 


aaggtggcag 


agacctcgcg gcgagcattt 


acatccacat 


47580 
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aattatctcc 


atatactcca 


ctaaaacggt 


atatgataat 


cttctattta 


aatgtacgag 


47640 


aaataaaaaa 


tacgaccaca 


ttgtaaaaac 


tatcagagcg 


caatggatgc 


gcatggtctc 


47700 


aaccggcgat 


ccgtcgcggg 


tcagtgcgac 


gggttgtttc 


acgtgatact 


tccgcgaggg 


47760 


tttatcctcg 


cgaacaatat 


tacgtgcggt 


gaacggcagc 


ggttttttgc 


acacacttgg 


47820 


ttcgctgcat 


ctggacgcac 


gtctaagact 


ttatacgtgt 


ggggacgggt 


atttcaaaac 


47880 


accgacccgg 


gccgcgggga 


cggtccgtcc 


gggccgtggt 


ccggactggc 


gattagtctg 


47940 


cctctgttta 


ccacaaatgg 


aaaatttcat 


ccgtttgatg 


tagttatact 


caaggccgat 


48000 


acgcctgact 


ctggaagctc 


gtggaccgtg 


aagttcttgt 


atatgtcatt 


aattgcggct 


48060 


tacagaaacg 


caatgcgagg 


tttaaaagat 


aaagtttcgc 


aatgtaccga 


tgccgccgtt 


48120 


gacggtgagg 


ttcatcctct 


aaccgtctta 


aaagaagcgt 


tggtatcacc 


ggacactgct 


48180 


acgcgacccg 


tgtccgcgtg 


caaccctcta 


cagatgttga 


ccggactctt 


acagtctagg 


48240 


gtacgggacg 


actacgtgac 


acaccaccgt 


gcgctcgaac 


gcccaggtaa 


tgtgagggga 


48300 


caagtaatcg 


ccccgacgcg 


caccgagatg 


ccaaacggat 


cgccaagtcg 


tgtaaggctt 


48360 


ggattccgcc 


ctcccaaaca 


agccaactat 


ccaaagacgt 


gggcgcaggc 


gcgtcacgtt 


48420 


ttctcgtctc 


gcgcatatta 


cgtgtgcgta 


tatgataacg 


aagaactaga 


taccaagtgg 


48480 


cagcggcaag 


atccgcggcc 


gttgccgcta 


gattggtccg 


atccggtcgc 


gtacctgtta 


48540 


gagggcgatt 


tgtttttagg 


agccaagcag 


aatgcgtttg 


tcgattctat 


agaaaagacg 


48600 


tgcaggtgtc 


agaactatac 


cattaagcaa 


ttttttccgg 


ttttgataaa 


tagggacaac 


48660 


gaaacagtcg 


acttaattaa 


ggagcatttt 


atagaggcgt 


gcttcgtgat 


tagaaaccag 


48720 


gtgtcagaga 


ggagcgcttg 


ggtaaaggcg 


gcgctgtttc 


gcaacgatag 


taacacgtat 


48780 


tggaaggatg 


.ttttgggatt 


atgggagcat 


gggcctcata 


agctgggtac 


ggctataaaa 


48840 


ctaccaacat 


cggaaccttg 


caatgccgac 


gtaaactgga 


gctggctcct 


gtgtgacgag 


48900 


gatataactc 


ggtcaattag 


cgggcagtct 


actgtttgct 


tagttgtctc 


gcctaccctg 


48960 


accgcctggc 


tggtgctccc 


ggggggcttt 


gttattaaag 


gccgctacga 


cctatcaagc 


49020 


gaggatttaa 


tgtttgtggc 


ttcgagatat 


ggccacccag 


cgtcgtcaca 


ttcttaaatc 


49080 


gtttttaaac 


aaggaatgca 


tatggttgcg 


acacccgggt 


acgtccgcgt 


ttgttcgggt 


49140 


atacaccgcg 


accactgcgc 


attctgccgt 


ttttgacccg 


ccggtaacta 


gcgaaaatgc 


4 9200 


gatgtcactt 


aactttttaa 


atgttatgat 


cgtaatcatg 


aaaccaaaag 


aatttggccc 


49260 


gtgcgttacc 


gtgtacatga 


atggagatat 


tctagatttt 


tgtgccacgg 


aatctgtcgc 


49320 


cataagggac 


gtgcctggta 


gggcggacct 


gtgtttaatt 


cgttttggta 


ccctttctaa 


49380 


tgcgccgagg 


agcgttccga 


tacccgggcc 


gttgaaccca 


catccgcgag 


aaaccgtgcc 


49440 


cgggctaaca 


aaacaggaaa 


ttatatacac 


ttcgcaaaca 


gtgccaagag 


gacagatacc 


49500 


agatgccata 


aaggggaaag 


agttccacca 


aataaatccg 


tttttgtggt 


ttgacggagg 


49560 


ggcgttttgg 


caactgttcc 


tctctgtgga 


ttttatgctg 


ctctgtcccg 


cactcgacac 


49620 


agttccgtcc 


ctggccagaa 


tcgttgggct 


tcttacacag 


tgcgataaga 


gcacgtgtaa 


49680 
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aatttgtacg ggggcccacg tacacgttaa cccgtatcgc ggatacacgc cacctgactc . 49740 

gcaagggacc tcaccctcgt gcccctgcct tatctcgtgc ggggccaggc gcgcggcgga 49800 

tgtcctggtt accggacacg ttaatctttt gggcctgctc tttgacccca aagcctcccc 49860 

caaagtgacc aagctgcgtt taaaaagaaa cccacgcccg gtaccgatag aggacgccat 49920 

gtcgggcgtc acggccgaag ggaccgaggt gcaacccact tcgctaccgt gggccctcat 49980 

tcgcctgccg gatttagcca gtcgcgtgat gctatacggc tgccagaact taaaaagcat 50040 

ctgcttacgt tcttattgaa gcacgtcgca cacatcagcg aactcggccg ccgtccgggg 50100 

ctcgcgtaca cgatggttcc gttttccttc tttatgttca ggttcttcgc cggaaaccac 50160 

ctggatagcg tggcgatgac ctcggagaaa acggcattgg cgacgtgttt ttggtgggcc 50220 

acgtaaccca cgtgaacgtt ttcaaccgac gagagaagaa cgctgataat ggcgaccacg 50280 

atccatgttt tcccgtgacg acgcgggata acaaaaacgc tggctttctg cttaaacgtc 50340 

tgtaacacgt cgtccgtggt ttcaaataaa ccaaagtgtt gcttaaacgt ggcaaataac 50400 

tggtgtgtct tttcgggtgc tttgatagac gcgataaaat aaaacgtgtg catgattaat 50460 

tgttgttgaa agggctcaag gcaacaaacc ccgggcacgt aactcccatt taaaaatgac 50520 

gacaggttgg ctaaaaactg ccgcagttct gcataaacgg ggcactctag gaaagcttcg 50580 

tgggtccgtt gggccgattg gtattccatg tactggtcct tggcgtttgc catttggacc 50640 

cgacaacatg tttgtttaag gtagtttgtg agttcctcgg ttaaccgggg caacgtagcg 50700 

tggctagaaa ttcggggctt tcccaacgta ggggccggca tgggttcttg aactctgcgg 50760 

tacgtttgaa gataatcgtc taaaacagaa ctatatgcat taaccgcgtg aataactccc 50820 

aagcatgggt gggccattcg ttcggtcttg tcgttgcccg atataataac gggcggttgc 50880 

cgaaaccagt tttcgcaacc accgtccgtg accacgcgca agttattttg aagacgttcg 50940 

cgatagctgg ttaacaacat gttcccttca agttttttga ataacgggca ccccgaaacg 51000 

gaacgccgct tcgttaaagg cgttcagtta gctttagacc tgtgtgacaa cactccggga 51060 

cagtttaaac tagttgaaac acctcttaat agttttcttc tggtatccaa cgttctgccg 51120 

gaatcgcgcc cggttagaga ctgtccgcag ccggaagggt ttgactttga acacattcac 51180 

ctcccaaaac taacacgcat gcagcgtgtc ctggggcgat actgcgacca tgttaacaac 51240 

gacgacacgt gcgttaacgt aaaggcaagt tcctcgaatt cacagggtgc cttgttttat 51300 

ctgccgtatg gacaggacga atggaattgg gcgctcacgt taaggaaaga caagttggtt 51360 

aaaatggctg tagagggctt gtcaaatccc acgacctgga aaggtttaga gcccgtggat 51420 

cctttaccgc tcatatggct tctgttttac ggttcccggt cgttctgtcg ggaaccagag 51480 

tgcctatatg aacgcaattt tggtatgaag ggacccatac tcttaccgcc acatatgtat 51540 

gccccccaaa aggacgtaat gacttttgtc catcatgtaa ttaagtacgt taaattttta 51600 

tacgtgaacg ccggtggggg tcttgaaact gaaccgtccc cgccgttcga ggcctcgcgg 51660 

ttgcgcgcag ccatcgctcg tctcggggac gtggaagcgg atgacgcata cctgtccgca 51720 

aagtgcatgt tgtgtcacct gtacaagcaa aacgatacga tttcgattca tgaaacacac 51780 
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tcatcgcatt aggcggagac ggtgcgagat atataacgtc tagtgttcgg 


ST25 
51840 


gctcaacggt 


gcacgagtcg gggagatttt 


gttttaatcc 


ccctgrtacaa 


cattgaaggg 


51900 


ctcgtaagca 


tgataaggga acatggcctc 


ggcagcagct 


aaaaaaatgt 


taattaagtc 


51960 


ggagctcgag 


tcggaaatca acaaaaaact 


gtccatctcc 


gtatttgaca 


ggtttggggc 


52020 


cgacagtgcc 


gtgtttaacg cgcagtataa 


gggaaccagg 


gaatcgctgc 


ggtcgtacaa 


52080 


cagcctaaaa 


aagaaggacg atctggcgac 


cgttgtcgga 


acgctagaaa 


cgtcgctgcg 


52140 


tgaaaaacaa 


agcgaattgg gattactaaa 


ggggtttaac 


aggaaaaaaa 


ttgaagagtt 


52200 


tgacgctgtg 


gcggacgcgg ttcgcgacct 


caaggacgag 


ctgtacggag 


aactggagat 


52260 


tctaggtacg 


cttgacaatg aatctgttcc 


cgtggaagaa 


gagtccccca 


aggacgacat 


52320 


tattaggtgg 


aaattggagc gtctgcccag 


agtgtgcccc 


aaaagccctt 


gatcccattc 


52380 


ccaaggttca 


gactgacgtc gacagaacag 


catcgtccca 


tataaccgtc 


attaaaacac 


52440 


gtaagacgat 


cgcccaactg aagataccta 


acaactgggg 


ccagtgtagt 


caccaggcga 


52500 


cggactggac 


cgccgtgctc ggacgcggct 


cgtatggtgt 


ggtgaggtcc 


atgtctctcg 


52560 


gccgctgcgt 


taagcatttt ggcagccggc 


gtgagttttt 


ttacgagtgc 


atttttaacg 


52620 


atatagtacg 


cgcctgccgg gagaaacatc 


ccctgaaccg 


cgggggtgac 


cgtatactat 


52680 


gtttcctaga 


gccgtgcgta ccatgtcgcg 


ccctgatatt 


cccgcagtta 


acagggaatc 


52740 


tgctaaacgc 


ggatcttaaa cacgtgaacc 


ctgaacggct 


ggccgttgaa 


ttctctgagc 


52800 


tgagggaagg 


cgttagtttt ctaaataata 


tatgtggcat 


cgttcactgt 


gacatcagtc 


52860 


cagaaaatat 


actgataaag ggggaactga 


caactgcgta 


cgggagactt 


atgatcggag 


52920 


atctagggtc 


cgcctcttta cacacgggaa 


ccccttggac 


cggagtgatg 


gtgacctcca 


52980 


aactcgggtt 


cgtgcagcac acgtaccatt 


ttaaggcacc 


ggccagattt 


atctgtaagc 


53040 


acatttaccg 


gccgtcgtgt ctcctctacc 


ggtgtctgct 


gtcgtgcgcc 


gggggcccgc 


53100 


aggcgcatat 


gctaaatcag ccgttccaaa 


tcactccaca 


actcggtctc 


acaattgaca 


53160 


tatcgtccct 


gggttatagt ttgctagcat 


gcctagagaa 


atatcttcag 


ccagctgacc 


53220 


catttcccca 


gcagggagcg ttggcggacg 


cttcctccga 


atccgcccac 


ccattgttct 


53280 


atttgcgttg 


catggtgcca agagtagtca 


tcgccgagat 


tttttctgtt 


gcctgggacg 


53340 


ttccactcga 


tttaggcatt gactcatctg 


gccacgcgcc 


agctattccc 


ctgagagaag 


53400 


cgtacaggcg 


gttttttgcc aaccagtgta 


gtttatatag 


ggcgcaatac 


aaagaggatg 


53460 


cgttagaaaa 


cgcatcctcg cggctgtgta 


actcaaaact 


taaactagtt 


ctccagaagc 


53520 


tgttggtcag 


ggactacttt agtcattgcg 


gaaactgcgg 


agatcatgga 


ttttttctca 


53580 


gaugagccgd 


tggttcagga gatggcgctt 


ctcgacatcg 


atgagcagca 


gcggctcctc 




tcgaaaatga 


gcctggccaa ctttttaaaa 


catgagcgag 


taagggcgtt 


ttttagcgat 


53700 


aacaaaaagg 


aaataagcat gccggctata 


cggttcgtgt 


ataattttta 


tctattcgcc 


53760 


aaggtgggag 


attttatcgg caacaccgac 


gtgtacgatt 


tttacgtcac 


ttgcgtgttc 


53820 


agggggaggc 


gcctgacgcg cctgtcggaa 


gtgtacgacg 


cgtgcctaaa 


catgcacccg 


53880 
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cacgatcgac 


accacgtgtg 


tgcattgata 


gaacaggtca 


cgcgcggcca 


aaacatcaat 


53940 


cctctgtggg 


acgctctgag 


ggacggcata 


atttcgtcgt 


caaaatttca 


ctgggccata 


54000 


aaacaacaga 


attcgtccaa 


aaaaattttt 


aacccgtggc 


ctatagtcaa 


caatcacttt 


54060 


gtagcgggcc 


cgctcgcgtt 


tggactgcgt 


tgcgaggaag 


tggttaaaaa 


gatactggcg 


54120 


acgttgctgc 


atccaggcga 


ggcgcactgt 


gaaaactacg 


gattcatgca 


gagtcctctc 


54180 


aacggggttt 


ttggcgtctc 


cttggatttt 


ggaattaacg 


tcaggtctga 


cccaaaagac 


54240 


ggtttggagt 


ttcacccaga 


ctgcaaaatc 


tatgaaataa 


aatgccggtt 


taagtacact 


54300 


ttttccaaga 


tggagtgtga 


cccgatttac 


gctgcgtatg 


ctaaacttta 


tcagaagccc 


54360 


agcatgcaga 


cgcttaaggg 


gtttttgtac 


tccatatcta 


aaccggcgat 


cgagtttgtc 


54420 


ggagaggaca 


ggctccccag 


tgaatcggac 


tatcttgtgg 


catatgacaa 


agaatgggag 


54480 


gtgtgtccgc 


ggaaaaagag 


acgcttaact 


gcagtacacc 


atctagttaa 


aaagtgcatg 


54540 


attcacaact 


ctacggcgcc 


ttctgatgtg 


tatatattgt 


cagatccgca 


ggaaaccgga 


54600 


ggccaaatta 


atattaaagc 


tcatctgagc 


gccaacctat 


ttataaacgt 


caggcatccg 


54660 


tattattatc 


aagtgttgct 


ccagtctctc 


gttgtacagg 


agtacatcag 


tctctccaag 


54720 


ggaactaaaa 


atttgggaac 


ccagaaaaac 


tttatagcga 


ctggcttttt 


tagaaaacgg 


54780 


cagtttcaag 


acccgagctg 


ctgcacgatc 


ggtgaatttg 


ccccgttgga 


tccacacgta 


54840 


gagataccga 


cccttttaat 


cgtgacaccg 


gtgtattttc 


ccagcgtggc 


caaacaccaa 


54900 


ctggtgaagc 


aggcgaccga 


attctgggcg 


gctagtgctc 


gtgaggcatt 


tcccgagcta 


54960 


ccatgggatt 


tatcctctct 


gtgtgcaaac 


gccccaccaa 


caccgtagat 


gtgaaggggg 


55020 


agcccataga 


tgtatccaaa 


gaattcgatc 


ctattatagg 


agaagaaagc 


attgtcttgt 


55080 


taacggcaga 


tgggactgcc 


cccgcggcgc 


tgtacaaacc 


caaaaccaag 


ccatccaaac 


55140 


ataaaaacaa 


taaattgtca 


gattttgttt 


aagcattcgt 


atctttattg 


gagtggggtg 


55200 


gatggtgtgg 


ggttgggaag 


ggaatgggat 


tgaggggagg 


atgaaatgct 


agaatcatat 


55260 


gtattttgga 


tatgcatcct 


cgtaatcgct 


cacgtcttct 


tctgtttcat 


ataacgcgtt 


55320 


cagttccggg 


gagtacttaa 


cttttgtcgt 


cttctcttta 


gggccttgag 


gaaggggtgc 


55380 


gtattcgggc 


gactttgcgc 


accatttcca 


atcgtttctg 


ataattctta 


agatgatggc 


55440 


taggatgcaa 


gtgatcggaa 


tcatagagat 


gttcacggct 


atgactttgt 


gcaggctggc 


55500 


agatatgaat 


accttgtcgt 


atcgaacgac 


gggaagaagc 


agaatcaggt 


aactgacaaa 


55560 


aacgccaacg 


tagaatccga 


tctggtgctt 


taggtacttt 


actagaaata 


gttctgtgtt 


55620 


gatataccaa 


gctattgtaa 


ggacaaaaaa 


caagtttatt 


gaaccaaaga 


caatatcgga 


55680 


aacgagcata 


tagaaactgt 


ttccaatagc 


catcatcgtt 


ccgagggaga 


atactaacat 


55740 


ctccatggct 


attaaggaca 


gataaagatt 


ggcgccaatc 


ggttttccgt 


aacgcaacag 


55800 


ggtgtctagc 


aagctattct 


caggaatctg 


ctgctctaag 


acgcgtagag 


accaggtggc 


55860 


cgaattgcac 


gatatcatag 


cctggacgtg 


cgtaaacgat 


aggcaaaaat 


gtatacagta 


55920 


tacaaacgag 


gctaaaagta 


tatgtttgta 


cgacagaacg 


tgaataaata 


actggatggt 


55980 
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ccacagcctc aaaatagaca 


taaaaactac 


actgtgggag 


cctatgagca 


cgatccatgt 


56040 


ctggaggctg gtcattgttg 


tcgcgtgcac 


gcgcttggcc 


ttgaatatgg 


ccaggatggc 


56100 


ccacgcatag taacacagca 


gatatccgtc 


caccaaaaac 


gcccacgaga 


tgtagacaaa 


56160 


catttctggt gcttccaaaa 


acaaagtcgg 


cgtcagatgt 


ttagcagaac 


ttctgaccgt 


56220 


taggtttaac gcgctgtagt 


tgaccaacgt 


gttaaagtag 


cacggaaatc 


caagtcccgg 


56280 


gaatgtggcg gtcagtggca 


ctaccgctga 


cattataaac 


ataagtccaa 


gaataaccag 


56340 


tagttttacc catgacgata 


agataaacga 


gtcgctccgt 


gaaattttca 


tagttcaaag 


56400 


aatctgctcg cgggtgcgtt 


cggcaaacta 


gcagtaaaat 


cagcaagtgg 


cttacatgtt 


56460 


acggatttaa ataggctccg 


cttttaaata 


tcacattctg 


tgcaaaggag 


ttgagccacg 


56520 


gcaccatgaa cgcccgggag 


gtggcactca 


cgggacatgt 


tttgcacata 


tcgctgcata 


56580 


gcacgcacga gcgcgagaaa 


ttaataatct 


ggcaggttca 


tttacttgta 


tgtcaacaat 


56640 


gcggaattca gggagatgcc 


gcatatctat 


ttgtcaccga 


aacattaagc 


aatactgact 


56700 


ggggaaatat accggcgata 


aaccgtcacg 


caccgtccat 


aaatgagcac 


ggccgtaatt 


56760 


atatgcagtg ggaactccgt 


actcgtttac 


ggaatcccat 


cattcaattg 


ttaagtcgcc 


56820 


agcccggtgc ggttaacgta 


agggtcagcg 


agccgaatat 


ggtaatagtg 


ggctgcgaac 


56880 


gagcgttgga tcactcgtgt 


tcggtgcgcg 


tgactggagc 


ctatcttcat 


tgcgatacca 


56940 


ctatggactt tagtttggat 


tctgttgtgt 


ccccaacccg 


cgaattttgg 


ttctcagaga* 


57000 


tgttttctca ctgtttagtt 


tccaacattg 


aagtttacct 


taaaacaacg 


ggcgggttat 


57060 


actatagggc atcgagtgcc 


acgcaatgcc 


gaaaaagggc 


gaaagatggc 


gcattgggta 


57120 


ttcttgatat ctttaattgc 


gaatctcgtg 


aaatacaagt 


tgccgggcag 


aagtacacct 


57180 


tgagtatcgc caccgcaaca 


tttcacgttc 


tctgggtgga 


cgaggcgtgt 


atgtggaacg 


57240 


gggccctggc cgaatttttt 


agggcgctgc 


acaataagtt 


gttcggcgac 


cgggaaggcg 


57300 


tagcgccaac gttaacgtac 


gtgtgtccgg 


gggccactcc 


ggagggaacc 


cccttccccc 


57360 


cctacttttc cgcgtttcca 


cacctcccgc 


tcgtgtttgg 


aagaccgcga 


aggctcgacg 


57420 


taaccgcggt ccaagaactc 


ccaaaagcac 


aaattgcggt 


acactggccc 


ccgtttaaag 


57480 


attcaatctt aggggatcag 


cttctcatac 


ctggcatttc 


acctaaaaag 


ccaggtaccg 


57540 


tacccgttcg ttggccgctt 


tgggtggagg 


atgttaactt 


gagtctctgc 


gagacgacag 


57600 


aaagcgttgc ccgcatagtc 


gacccacatt 


ctatagtaat 


cataaaattt 


tcatcactgt 


57660 


tgtgccagca cctaaaatgc 


caccgtgcgt 


ttgttaaaaa 


tgagttagaa 


tacatagcaa 


57720 


ccatctgttc cagcgacctt 


cgcctcttca 


tccaagagga 


atacaaccgg 


ttacttgcca 


57780 


ccatttttac gtgggccgcg 


gcgagcgggt 


atacctgggc 


ggccattgat 


aaaacaacag 


57840 


tattcatcaa ggctccccag 


ctcagcgcag 


ctgtaagtgg 


tttctgccca 


tcactaaata 


57900 


gttgccgtag gaaacaatgt 


tacgaaggtt 


aaaaataaca 


gttcatttcc 


tttcacagga 


57960 


acagcaaaag gtcgtgaccc 


gtcttgaggc 


gcatttggga 


cttcccgtac 


aggaaacttc 


58020 


ccacccgcct gactggctca 


agtgtgaggt 


ctgctccgcg 


tccgtgtttt 


taaaaatacc 


58080 
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agccggggtt 


ttgtatgccg 


gactcgcaag 


agaccccacc 


agggaagcaa 


aacgggactc 


58140 


gtggctggac 


tgtctagtag 


aaggcgcgac 


gttgttgctt 


aacaactcag 


tgttaccgat 


58200 


tggggcgctg 


gcgggtatct 


tacccaccct 


ttttgccaac 


aggcggtgtg 


ttaatttttg 


58260 


gctgctgcca 


cgcgcgtggg 


taaaatcggc 


gcccatatgc 


cctcccctac 


cgattgactg 


58320 


tgttacgcct 


ccacagtttg 


tcgtgacaaa 


gcgtggacca 


atctgctggt 


acaaggaatg 


58380 


gccgttaccg 


gttgacgttg 


attttatgta 


ctacctacag 


gaggcactat 


gtgtttttag 


58440 


tgttgtgtcc 


aacggggagg 


gtacggagag 


tcacgcggac 


aatatacgac 


aattagagaa 


58500 


gtttgaaaag 


gtactatgtt 


tattttaaaa 


cagagtctgt 


ctggggatgc 


tgtgtaggct 


58560 


ggttatttga 


aaatatttta 


ataccgatgc 


gttatgaagg 


tgtctggtaa 


aaattgtgcc 


58620 


gcaataaaat 


atgttaaaag 


tctccgcgtg 


gccctcttct 


tctccaacgt 


tctctgttag 


58680 


acgaaaggat 


tccaaataaa 


tgggtttcag 


taagcctttc 


atgtcgtctg 


cgttttcgtc 


58740 


caacagctta 


tggagtttga 


accgcactag 


gggcggaacg 


cgcaaaatgc 


cttggggtgg 


58800 


gtttaactcg 


tggtatgccg 


ccaaggtcga 


cgctatcact 


tcgatcccat 


acaatacaaa 


58860 


cgcggaatca 


ttacacgtat 


acctctcgtg 


taggtacctg 


gcccggtcaa 


gcgcggacag 


58920 


gataaatgat 


agggggggct 


tatcgtagtt 


ttccagcata 


gaaagaatgc 


agcagagttc 


58980 


tgttacgctc 


gcgctcgccg 


gcttaaacag 


ctgcacgcgg 


tcagaaaaaa 


cgtgcggata 


59040 


tatgtccagt 


gcgtcgatca 


tgtcctcgtg 


ttcgaccacg 


cacgcaaaga 


caaacccagt 


59100 


aaacgttgcc 


gactgaagcc 


tggtgcgcac 


gtgcacgcct 


aaatcaggaa 


cgtttagtcg 


59160 


tcctcgaaca 


cacagcttag 


gcgtattgga 


agccatgcaa 


atgctatttt 


ttttatgtac 


59220 


ttccaacatc 


agtcgcggaa 


aaactggata 


attatagggc 


cctctggacg 


agcattctgt 


59280 


ttcaggcatt 


accgcctcag 


atcggtgctg 


ctcccccatt 


agacgcttca 


gtatttggtc 


59340 


catcggttcc 


aggctggcta 


atttgttcgc 


aatatttgtg 


gcctatgtca 


tttatataca 


59400 


cagacagcct 


gctgtccaca 


tacagctggt 


ctgcaatatc 


gtgtaaactt 


aaggaaatga 


59460 


gagagtgatt 


tatgaggtcc 


.atgtgagtca 


gtttggcgat 


tatgtacgta 


aacgggccta 


59520 


agagaatgga 


aaccgtgtcc 


tgcgagtagg 


cgatgctcgt 


tcgctgcccc 


tgattgtcga 


59580 


ccacgggtgt 


gaggttgtac 


gtttgaaaca 


tttccccttc 


ccacaggcta 


gttagctctt 


59640 


tgagcatctc 


cataaacgga 


gggacgtatt 


gtgaaaaaaa 


gctgtttact 


acggagctgc 


59700 


ctttagggat 


gtgacattcg 


gataggttaa 


tggttgggtc 


gcgcagaccg 


gataaagcct 


59760 


gcaacgtatt 


ctgagtgagt 


atattaatat 


cgtgtctggg 


cctgcctccc 


tctggtaaac 


59820 


tctgaagcaa 


ctgcgtttca 


atgcattgaa 


tcttctttac 


gtagtcgtct 


cgttctttct 


59880 


caaggttaga 


aattgtatcg 


aactgctcgt 


tgatctggtt 


cgtcaggcat 


ttgaacacgg 


59940 


tatttgaaac 


ctgtttccgg 


agaccctggg 


tcgtactttc 


cgtaccggtc 


ccaaagagct 


60000 


tgtttttgtc 


cacaatgttt 


tgagagacgt 


ctgtgacgaa 


atcctcaaca 


acgtctgtta 


60060 


t'gccgctcac 


tgttttgttc 


tccgcgagct 


ttatcagcaa 


gtttaggagc 


tccctcttgg 


60120 


ggtcggtttt 


ttgagacgat 


tgctctaagc 


gagccaatat 


gtctttatac 


atgtcgttgc 


60180 
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acgtgctacc 


gaccagcgct 


tttagaggcg 


cggtatttag 


taactgacat 


aactttgcgt 


60240 


gctccgcggt 


cctgtgacat 


gtcataatcg 


cggtaaataa 


ccggtacaga 


gggctatcaa 


60300 


aggcgacgcg 


tccattgtaa 


attataggcc 


gtgaaacaac 


ttcaactgtt 


atatcttttt 


60360 


gtctgtaggt 


gccgagataa 


gcagccttat 


ctccgttaaa 


ctccacagac 


acctcggcgt 


60420 


agtctggaat 


ataaatagac 


gtcacatcgg 


tcattatcga 


ggtaatct cc 


tgtgccagtt 


60480 


tgtcgacaat 


ccgcgatatc 


gtaacgtctc 


cggaagcgta 


cgttttcctc 


atctgttgac 


60540 


agtgtgcttg 


tatcctggcg 


atcgtgtcgc 


cgtcgggggt 


gcgcctgaca 


ataggcacca 


60600 


acccaagggc 


gaccacccag 


tcgacgtatc 


gttcgtacga 


taaaacagtc 


ctggtgtgta 


60660 


gtacctgatt 


aatagtgttc 


agtaacagtt 


gatcgactgt 


tagacgaacg 


gtacgtgccc 


60720 


acgattcaaa 


cgtagaccgt 


ttgaattccg 


gcgtcaggtg 


ctcgacaccg 


cgccaacggg 


60780 


tcttcagggt 


ggtctcgaag 


ttggtccagt 


cctttaagac 


gttgtcgtat 


gtgcagctcc 


60840 


ccaatgccgt 


cttgtacagg 


tgtatgaaca 


gctgtcttcc 


aaagatccct 


ggatttcgaa 


60900 


gactggagtg 


tagcgtttga 


ccgcgaacgt 


acgcgtactt 


gccctgtagg 


atctcgaaaa 


60960 


gcgagatgga 


gagttccgtc 


ggatggacca 


aacacgtcga 


accaaatccg 


gggttcatct 


61020 


tgaacatgac 


gtccgatgcc 


aaagtcaggg 


gtgtcgtgga 


tcacgtcagt 


cgcctgtcaa 


61080 


atataactac 


cagcccaccg 


gaaatgggtt 


ggtacgacct 


ggccttcgat 


ccggctgaag 


61140 


actccgggcc 


gttcttgccg 


tttaccgttt 


atctaattac 


gggaactgct 


ggtgctggga 


61200 


aaagtaccag 


catatcggcc 


ctgtaccaaa 


atttaaactg 


cctgatcacg 


ggcgcgacca 


61260 


ccatagccgc 


acagaaccta 


tcgcgtcgcc 


taaagacgtt 


ctgtcccacg 


atcttcagcg 


61320 


cttttggctt 


taagagccga 


cacatcaata 


tagccgtcag 


aaaagctcat 


cagaccggag 


61380 


ccgtatccat 


agagcaaatt 


cagcaacagg 


agctatcgaa 


gtattggccg 


gttatagtgg 


61440 


acattatgaa 


agaggttatg 


gcgaaaaaac 


ccaatggcat 


gtacgggact 


atatccaacg 


61500 


cgaattttga 


aaccctctcg 


agaatgaccg 


gaccgtgttt 


atggacttcc 


aatattattg 


61560 


taatcgacga 


ggccggaacc 


ctgtcctctt 


acatactcac 


caccgtcgtg 


ttcttttact 


61620 


ggttcctaaa 


cagctggcta 


aatacccctc 


tttaccgcca 


gggggcggtt 


ccgtgcatag 


61680 


tatgcgtcgg 


ttcgccaacg 


cagacaaacg 


cgttccagtc 


aacttacaac 


cacgggacgc 


61740 


aaaagacgga 


gatatcgtcg 


tgcgagaaca 


tcctaacatt 


catgataggg 


aagaaggtcg 


61800 


tatctgagta 


cgtacacttg 


gagagaaact 


gggcgctgtt 


tataaacaac 


aagcgctgca 


61860 


ccgatctgca 


gttcggacac 


cttctaaaaa 


ttttagagta 


taatcttccc 


attcctgacg 


61920 


aagtcatgag 


ttacgtagac 


agatttgtcg 


ttcctaaaag 


taagattatg 


gatcctttag 


61980 


aatacattgg 


ctggacccga 


ctctttttgt 


cacatagcga 


ggtaaaggcg 


tatctaacaa 


62040 


acctacacac 


atgtctaacg 


ctagggggcg 


ataccaggga 


cacgaagctc 


tttacctgtc 


62100 


ccgtggtgtg 


cgaggtgttt 


gtgaagccgt 


ttgaggaata 


caagcgggcc 


gtcaacctca 


62160 


ccaacctcac 


cgtgaccgag 


tgggtgacaa 


aaaacctctt 


taagttaagt 


aattattcgc 


62220 


agtttgtgga 


ccaggacatg 


tccatagtcg 


ccacggaatc 


cactgaacgt 


tcgacccagg 


62280 
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ttacctttat 


caccaagttt 


gttaaaaaca 


gccacgtgtc 


tttaaatgga 


aaaacaaaaa 


62340 


aatgcatatg 


tgggtttcag 


ggtacgtact 


ttgagttcaa 


aagaatccta 


gacagcgaac 


62400 


tcttcgtgga 


gacgcattcg 


caagaccgtc 


cggaatacgt 


atacggtttc 


ttaaacacgc 


62460 


tactgtacaa 


cgccatgtac 


tcgtttcacg 


cgtacggcgt 


gactaggtca 


cacgagaaat 


62520 


atctgcaaga 


cctaaaattt 


gcacccctcc 


cggccgctct 


ggcaaccggg 


cgtgtagacc 


62580 


ttcaaacggt 


tcgtgaagag 


ttaaacctgg 


aagacgacat 


cttctaccac 


gtgtgcagtc 


62640 


ccccgccacc 


cgcgggtatc 


acctccctcc 


aggttttggt 


cgacacgtac 


tgcgccctaa 


62700 


aggacgtgtt 


cgcctccaga 


ataaaggtgg 


cgtgtcgctg 


gtttggcggg 


gagtttgaga 


62760 


aggaaacgtt 


ttccgcgttt 


acggttaaca 


tggtcgtaag 


ggacggagtt 


gactttgtct 


62820 


ccccttcaga 


acgtctcaac 


gggctgttgg 


cgtttgcatc 


gaccgttgaa 


tcgtataaaa 


62880 


ttaaggggta 


cacgttttta 


ccggtagcgt 


tcggtcgctg 


tcagggtttg 


ccactcagtg 


62940 


atgacctcag 


gaagaagatg 


ccctccctgg 


tcgtgcagga 


ctctagcggt 


tttatcgcgt 


63000 


gcctagagaa 


taacataacc 


aaattgaccg 


aaaccatgga 


ggacgggagc 


gttttccaag 


63060 


tgtgctgtgc 


gggggactat 


ggggtcagct 


caaatttagc 


catgaccatc 


gtaaaggcac 


63120 


agggaatgtc 


gttggagcga 


gtagccgtag 


tatttgggtc 


ccacaagaac 


gtccagacaa 


63180 


gccacgtgta 


tgtagcaata 


tcaagggctg 


ttaactcaaa 


ttatttggtc 


atggacagca 


63240 


acccccttaa 


aaccctcctc 


agagaaccag 


tcgataacac 


ctccgccaag 


catatagtcc 


63300 


gcgccctcca 


caacccaaac 


acaaccctca 


tctactaaat 


aaataaaaac 


aagcaaatgg 


63360 


aaaacattgt 


gtttttattc 


agtccaacca 


cggccacgga 


tagttgtcat 


ttccacacac 


63420 


cgggggggtg 


gccgccatag 


tttgacgacc 


agaaacgccc 


gggccctctg 


ggttagaagt 


63480 


agccccgccc 


cccgtggtcg 


aggtcatacg 


tctcttaggg 


ccctgggttg 


taccaacata 


63540 


aatcacgtcg 


tcatcaccgt 


tgcagtccct 


gcgtgacgcg 


cgcctcttgc 


aggcggcacg 


63600 


aacaccgccg 


cttgtaacgt 


gatctgcatc 


gtcgtcgctg 


tctgaagatg 


aacacaggtc 


63660 


aatcacatca 


gtggcgcccg 


tgccctcatc 


ggcgtctgat 


tcgtcccatg 


tctcaacaga 


63720 


gcctcggtcg 


tccacttcgt 


cgtcatctga 


aataataatg 


gtctcttgtc 


ctgcagttga 


63780 


tccagcaaca 


acaggttctt 


cttcatcgga 


agaccatccg 


ccctctgccg 


gtgatctaga 


63840 


tatatcacct 


gaccaggcgg 


gcgaggcccg 


tggcgaaaag 


ccgtcagctt 


ctccgccact 


63900 


ttcatcagtg 


tcctcgtagt 


catctataaa 


aaaatcgtct 


tcgctgtctg 


attcagatga 


63960 


ctggggtgcg 


cggtgttggc 


cagtaaccac 


cgcaacgggc 


cgcggaccac 


caaacgcggt 


64020 


cagtggcgcg 


ggcgcgctcg 


gtggtgtggt 


ggcacctgtc 


gaagtaatga 


cggtaggtag 


64080 


ccggccgctc 


tggttaaagg 


actcaccgtc 


ccgtgcgtgc 


ggatctattc 


taggtgggga 


64140 


ggagaccggc 


ggttcgtctt 


cttcatccac 


atcactgtag 'tcgctgttct 


ccgaatgaga 


64200 


gtcggtttcg 


tctggcgttt 


ttggagtagt 


gggaggggaa tcttggtcaa 


aaacatcctc 


64260 


aggtgcagtc 


atgtcaatca 


ctggggggcc 


tccgtacccg ggcggtaccg 


gcaaccccgt 


64320 


acgggacccg 


gcgaacaaaa 


aacgtcttgt 


tctctttttc 


ctaggtgccc 


cgggaatcgg 


64380 
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cagcatcctg ggagtggccg gcggggtcct tgacgggtcg gataagaaca tagccatggc 64440 

cgaaacgttc acctgtaaaa cgcactgcct atccccgatc agttaatatt ccagtcaacg 64500 

gcacccctcc gattctgaac tagatagtca ttcgcaagtt taaaatggtt gcagcctaga 64560 

aactgcggcc agggagaacc ggttgccgcg tgggcggtag ccaggggaga cggatggcgc 64620 

gccttgagga cgagatgtct ctgtccgttt atcagcgtcg ccttgtctat agcctttctt 64680 

ccccacagca taaaaacgca cctctgaagc ttggcagaaa ggcagcttat tatgtagctg 64740 

gtgaaccaat cccagccaag gttggcgtgt gaccccgcct tccctctttc caccgtcaga 64800 

atggtgttta aaagcagaac tccccgtttt gcccaacagt ctaagcaccc gtgagaagga 64860 

gcggtgaacc caggtacagt attggctatc tctttaaaaa tatttttgag gcttggaggt 64920 

atactgtagt ccggagcaac gctaaaggct agtccggtgg cttgaccgcg atggtaagga 64980 

tcctggccaa ggatcacgac tttaatatcc tccggttcgc aacaatacga ccaccacata 65040 

attctatcta tcggtggata aatcacggtt acgttgctca tatccataac gcgcttcagc 65100 

agagccgcga gcttctgttt taaaaatgga gaaagattta aaaattcaag ccatgagtcg 65160 

ctaagtagca acgtttgtgt ggagggttct tctaacactt ctggtgacat tttactccaa 65220 

actattgttt taagccaacc ctccataaat ccccgcgtct ggtaattact ttacgtatac 65280 

gtgaaacttt tgtacttctg tctgagaaat ccaacagagg tggccgcgta taagcagttc 65340 

cgctttcact accgtttatg tgaaagtctt caaactcggc taaaaactca tcaagactat 65400 

attgcatata attcaacaat tgaagctctt ctctggggac actgtaattt aacttgtaca 65460 

aaatccttat aaaaaacgcc ctaaggtgaa atccatttac acatatttct gtggtagagc 65520 

gatcagcttt atatcgcaat gttgctgtac acagccctgg acataaagct aaatttgtta 65580 

aatgaaaaga agcaatctca aacggctccg gaaatataaa gtcactaagt ggtattattt 65640 

tacaacatgg atttactaca agcccggtac aacatgtaat taaaaaaaat ccacatgtaa 65700 

taaataaact taaggtatac atacttctca tgttttatac ataatatgaa cgctaatgct 65760 

gtacttatat aatatgtata gaattatgcc aatacagtca tggtaagtat attttaaggc 65820 

ataatgcaaa catcaaataa tataacacaa aatgcacgct tccggatatc cgcccagctg 65880 

ttaacccgaa aatacagaaa tgactacaca aacacacctg aaaccaattt tattctcaac 65940 

atatgcaaca atatttcagg gtaacaccat gttaataaaa tacgcagcat gcacattttt 66000 

agctaagctc ctcaaagaca atttcctctt caattgatga ctcatcaacc tcggtttccg 66060 

tgttaatgtc ggaaatatat gagtccagga taccctcgtc atcgccacaa attaattcca 66120 

gggcttgtaa aatatcatcc agtgaatcag ctgctaatgt caaggatgtt gtttctaatt 66180 

ttagcccatc caaatggtga gtgggtgaca ctttgggaca agaccccgta aaattggcac 66240 

gctccacaca aacggaagga tgcatttgat cgttcacgag gggcagcata tttattttta 66300 

taacatcagg aaccggaccc gtcaatgacg cacatatgtc cattaagatt ggagtcgtga 66360 

cactggttga attgacggtg aaaaaaaatg tattcgtgtc acatgcgtaa ctatttttta 66420 

accacacgag ggcggaataa aacgggtccc cggcgctaag gccgtgaatg cggcaggcac 66480 
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ggctcattgc 


catttccaga 


tttgttaaat 


ctgttcgaac 


atacgagtct 


aaaatcaagt 


66540 


ataaccacga 


caagttcaaa 


caggaggcaa 


cgcggctggt 


ttccacccgg 


acatccccgc 


66600 


tcacaaaccc 


gaactctcta 


ggaacgcaca 


tcaaagcgtt 


cagacaaaat 


tccgaaaccg 


66660 


aaaccctagt 


tctgagcacc 


atctggttat 


aaactgcggt 


taagagacga 


gcggccacac 


667 20 


actgctgctt 


gtgaggattc 


agcttaggtg 


gcctgcaggc 


ttgctgacag 


gcccgtagct 


66780 


gcctggcggc 


gctcgcgcac 


ctttgacagc 


cgcacgccag 


ttccagagcg 


aggcagaagc 


66840 


gctctttgca 


gtcgcgccat 


atctcggcca 


taggagattc 


gctcgcgtga 


tcttttaggt 


66900 


gcatatatat 


ctgttgggta 


acaaacctca 


ctccctgtaa 


aaggggaata 


aggtccgttc 


66960 


tttctatcgt 


tttctctgaa 


tctatagtta 


catctttaag 


tataaccaaa 


gacgctaaaa 


67020 


acccaggatg 


gtcgacacct 


ttaaaaaatc 


gcaaaagcga 


ccggatggct 


ctatccgcgt 


67080 


ttccatgtcg 


ttcaaaggta 


gtcacgatgg 


atctccagtt 


agactctgtc 


tccctgttaa 


67140 


cacccttaac 


tggaatggaa 


acagccatga 


caccgtgaac 


ttcctgatgt 


ctcctaaaac 


67200 


taaccccgaa 


acagagctaa 


ataccaatga 


ctgtcacccc 


taccccaagc 


cacgcccccg 


67260 


tactattaga 


ccagggtgag 


taaccacgct 


atcctttaaa 


aacccatacg 


tggagtttgt 


67320 


aaggtaaaca 


gctcgtgtat 


atttcagacg 


cctgtcataa 


aatggatacc 


gacgacaatc 


67380 


aggtaattaa 


acttttttta 


ttcaagattc 


aggagggcgt 


gtttacagtg 


gtgtaggtgg 


67440 


gagcatatct 


cgccaatggg 


aatggctgat 


gaatccacac 


ttagtgctcg 


gatcaggtgc 


67500 


tctgtgtact 


ttactagtgc 


cgatgaacct 


aaatttggcg 


tgtttaggtc 


gtaccaacca 


67560 


aaaaactgcc 


tggcctcgcc 


ttttagaagc 


tctatgcaag 


ctttaatcgt 


gtccaatagc 


67620 


ttgtctttaa 


aaatgcaact 


ctggtacatc 


tttatgacag 


tggtccaaaa 


aaaacaaaga 


67680 


tttaaaaaca 


cgttaaactg 


cgtgtcctgg 


tagtcctcgt 


atataacctc 


ctcaacaaga 


67740 


aaaaattttt 


taaccaaact 


cgccaggtac 


tgaaacgatg 


gtacggacag 


gtcgtgaaaa 


67800 


gtgtctatca 


tctccttcac 


ctcccctttg 


aaggttttgg 


ttacaccgac 


catgtgagat 


67860 


aggcaccagc 


tcagggggga 


ggtcggatcg 


tgagggggta 


acaattcgtg 


ggcgtggggg 


67920 


tactccagtt 


ttaggatctc 


cggcaggacg 


cgtacgagtt 


cttcgtcaaa 


tcctacgcag 


67980 


cccgcatcca 


ccaggggcag 


tactgagtta 


ataagtcccc 


gaattctatc 


gttcgtaatt 


68040 


ttgtggagct 


cctttagaca 


gtaaaccatg 


ttgctccgac 


actgcggtac 


gatgaaccgc 


68100 


tctccttgtc 


tgtgttgcat 


gagcacctgg 


agtatcttag 


acaggtatag 


ggcccgttcc 


68160 


actctaaggg 


ttgccgcttg 


gtagagcgga 


ttgcgggcat 


tgacggtcag 


tgtgttcatt 


68220 


tcaccgacta 


.tcatgccgag 


tggcggatag 


tggaacttgt 


acatatgatt 


taaaagatgg 


68280 


tctttcccgt 


aatggcgact 


catggtcgtg 


ttttaacgtt 


cacagggtgt 


taaaaaactc 


68340 


agaaggtcct 


gcgtggacaa 


cttttgcgga 


ctctccgaac 


cgctcaggga 


taagctgtac 


68400 


cacgcggtgc 


ttcttataaa 


cgaagcatac 


gtcaaaaaac 


atgacccacg 


tgacctaaat 


68460 


aagctcacag 


aagaagtctg 


tcgatgtatt 


gtaatggaat 


gtgcctccct 


gggtcccatc 


68520 


tccgggctca 


tcgcggatct 


gaatctcttt 


aacctgttct 


gcctctaccg 


gggttcccgg 


68580 
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gttaaaaccc 


gcggggccgc 


cacctgtaac 


gtcccgtgcg 


cagaatgcgc 


ccagggcatc 


68640 


gtgagaattc 


tcaccgagag 


ggccctttgt 


tgtaccgaaa 


agatgttcat 


agcgtctgcc 


68700 


tgcagcgggg 


tcgttatacc 


gccccagttg 


gccagagttc 


tgcacgacgt 


ctacgcggaa 


68760 


atgaaggcca 


agtgtttggg 


ggcgtggcgt 


cgtctcatat 


gttgcaggcg 


gcctatcatg 


68820 


gctattgccg 


actcggtcct 


cgtcacttat 


aataccctgg 


atgccgaggg 


aaaactggaa 


68880 


cttagactca 


aagcactgtg 


caaacttgtt 


tttcaaccca 


tctttcttca 


aagaatctta 


68940 


gcccctatgc 


agctactggc 


caacgggaag 


atggttcctg 


acaattattt 


taccatcacc 


69000 


ggtacggccg 


agaagaggcg 


ccctgtcgtg 


actggaagta 


ctagcgggat 


gacgtgtccg 


69060 


ggaagcagcc 


ttgtccccga 


ttccttaatc 


ctgccagtat 


gcgaaccggg 


gcttctcccg 


69120 


gcacccctgg 


ttgacctcag 


taatgtctta 


gaaaatccag 


aaatcatcct 


cagcgcccca 


69180 


cccctgagtc 


aatttgtcat 


cacaaacacg 


caccccagtc 


tgcctcagtc 


agtcagcatt 


69240 


attacgccaa 


cccagggcgt 


tgttcccggc 


caatgtttta 


tggacacgtg 


gaaagcggtg 


69300 


tcacagagca 


ttcaccacca 


ggcacagacg 


cctattttgg 


ccgccgcact 


aaccggttcg 


69360 


acatctgcgg 


cccctggccc 


gcatatcgca 


tgttccccag 


ttgccggcac 


gtctcggcag 


69420 


gtggaagggt 


ccgcgggcgt 


cgattgcggg 


aaaccagcat 


gcgttccgca 


gcccgcgtta 


69480 


ccgcccaatg 


tccccgccaa 


gaggatggaa 


acggtagcac 


agttgggaaa 


cgctcccgta 


69540 


aaaaacgtcc 


acatcggagg 


ccgcgtatac 


gctccactgg 


ttaatatacc 


aataatagac 


69600 


ttaacgtccc 


cgtcagggtc 


cggccagagt 


ccggccgata 


tcgccaacac 


tccagagtcc 


69660 


cgcatggcgg 


ccggctctcc 


gcccttcgcc 


gaaaccgccg 


caacggtccc 


cgctaagaga 


69720 


aagcagccac 


gcgaggacgt 


ggcagacaaa 


agactgaagg 


gcgacgttcg 


gggcgccgca 


69780 


acagtaaacc 


accctttccc 


gggaccgtcc 


gggatgcgcg 


ttcgcgagca 


gggcttattc 


69840 


gatttaatcg 


aaagctccac 


ggatgtaacc 


gcgaacgcat 


ctggacccaa 


aaacgacgac 


69900 


gacatgctag 


cggctatcct 


acaggacctg 


tatggactac 


agtccccccc 


ggccatcgat 


69960 


tccccctcca 


gcaactcgga 


caatgaggag 


atatttccag 


aggttagtcc 


gccatctagc 


70020 


ggccacggat 


cgccttgaaa 


gatgacccca 


agacgcaaga 


tcacaccacc 


gagacagtcc 


70080 


cacgcaaccc 


ggcggatgtc 


taaccgtaaa 


aatatcaccg 


gttggaaagt 


cccttggtgc 


70140 


gacaaaaatg 


agcttcaaac 


ttacccattt 


tctgtgctat 


ttcgggacac 


taaaaaattg 


70200 


tcgccagtgg 


cgacaatttt 


taagtctgtg 


taaatgtcac 


tctttctcaa 


gtttctgtcc 


70260 


tagaattttc 


acatccagtt 


tcatatcata 


taaagacacc 


tgttttgttt 


tcgtgaatct 


70320 


ggcgtgtttt 


tatcttgaga 


ggccccctaa 


aaacatgcct 


cgtgtgaaaa 


cacaacccaa 


70380 


gagaccccaa 


gtgcttgaat 


ttatgccatt 


agatctccac 


ggtggaacac 


acacggagat 


70440 


ggattctcaa 


aacctgtgtc 


ctgacggcca 


ggatctgctc 


gggtcttata 


tctatacgga 


70500 


gaataacggc 


ccgttttccc 


aaataatgca 


caatggacag 


agcaataccg 


ggacaggtga 


70560 


aagcttcggc 


agctacgctg 


ccggcgacgg 


ttttctgggc 


ggttctgtgt 


cagggatgta 


70620 


tggaaacaac 


accggagagg 


gcgcgtgttc 


taaaagaccg 


tccgcgtgcc 


gtaaacgctc 


70680 
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ggctgcacta attcacgcgg cgtccgaggc gtctgtggcc gagcaaggca cctcacaggg 70740 

ggcacatgcc gtatctgacc ggataggcag agacggtggc gctgacaata gactactcaa 70800 

ggtgagtgcg cggctgtcgg acaaaacaaa gagcgccctt cgcagccatc cttgcttgcg 70860 

ttgctattct ttgatgttta acacgtaatg tctgtttata ttaaccctgc catagcccga 70920 

accgaacact gtgcgatgcg attataaaac gcaacactcg gacacgtcag atggacgcgc 70980 

aaaacgcgcg cggcagtgca agacctcagg aaaaaataaa gcccaacagc tgcaacaggt 71040 

acaataagca aatttagtaa tttgtatttt ttaagtttcc gaggttttcc cgctggcctg 71100 

cggcaccgtg tccgtgcttt agttggaata cctctcctaa caagcacgta ttccaacagg 71160 

agctggaaag taaaacaacg caactgaccg tcctcaccgc tcaacacgaa gcccttcaaa 71220 

agcacgcatc atttttacag aagatgatga tattgatgtg caaaaacggg aacagtaaaa 71280 

aaagcacgtc atgatttaaa gccacctcgg gttactagtt taaaactgcc taattgattc 71340 

tattcaccct tcaataaaat aaaattttaa tacgtttgcg gtttgtccat ttcctgtctt 71400 

aaaactatta aacaattcat ttgggaataa ggatttatgg gattaaggga ttaaattttt 71460 

ttggatcatg ggatttggga acatacgtct gggatggagg ttatgcttca tggtctgggt 71520 

ggcgtggatt gcacggggac ggtcggtgtg cccaacctgg cacctgacag atgggaaata 71580 

cgaggcggta tacaggcact acctcgaaga gtgccgcaaa catgaaggct cggggagcct 71640 

ggacggttcc ggacagacaa aggggtctgg aaccaaagca accaccgaag ctaatatatc 71700 

gataagacct aacgttgtca catcaggtca aaataaagag ccgcctggga cagcaccgag 71760 

ggccgaatca tcacacgacc tgccacgcat caagcaggtt aacgctctcc gattatcaac 71820 

cccggaattg gcgcaaccac tcccggtagt aaaatcgact ccgcgcgagt cacagtcagg 71880 

tgggacaccc tggaacgcgc gcccccacgc gttcattatg cacacaaacg acatgctcaa 71940 

cccatctgtg gtcctgtctt tcagagccat ccgtgcgcgg tccacacgcg ataccgagca 72000 

gtccgttcgc gatcggaaca cggtcacgac cagctatcgt acccctggcc gcccttccct 72060 

ctttcaagcc agaccctcgt ctcacggtgc gcgtctaccc ccttcgcccc gaacgatggc 72120 

aagatacgcc gagtcgcgaa caatatgcga ccaaaattga ccgcaaagaa taccacgccg 72180 

tccgttcgac gcagtgcccg tatggacgct gcacgggacc tcaagcgtcc gtgtcagacc 72240 

cattcctccc ttcgccgacg ccgtggcacc tggttcaccc cctgtgggct atggccggaa 72300 

tattgggcct cacgatcgcc ttgtacacga tatatcaaat atatgcacat agaaactaca 72360 

cgacgcttca ctejcgaataa aggtttattt attttgcaca ctagtccgcg tcgttatttc 72420 

tggtcctgga ctgggcgcgc ctcctcctga gcgccccgtc cgcgctggca gcgattccgc 72480 

ctctggtggt ctcgtccatt gacacgtcca cgcggatact aattccggcg agggcgtctt 72540 

ccatctgccc gcgcgtcacg gccttggccg tggccgccct caccttagct tcaatcttac 72600 

gggttgccac gttacataag gccgaaacgg cagctgcgat tttggcctcc ttttcggtgt 72660 

tagagatgac ggggtcggag ccggacctgg tgggatcccc agaatttatc agctttttga 72720 

gctgcttgtt ttcaacggac aatttttgca cctgagcggc taactcctcc attgtcagct 72780 
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tagccttggt atacagcgga tgttatttac ggaagtgcca catgaaaccg tggccgggtg 79140 

cgtaacagta aaccgggtcc aatcgcgcag tttacacaac acataaaata ggcgaccgtt 79200 

tatagcggtt tatttgaaca tacaacacac atttcaggag tttcggtctt tataaaaatc 79260 

ttcattcatt caaagtgccg atatatttca cgctggtgga caacaataca tatcggcgcc 79320 

tgtgattgat cctcctcctc gtatagggac caggcagaca cccaaatctg tgcaaacgga 79380 

tcgggtggcc cgggatcaga gggtcgtggt gactgggcta actttaacat gtattcatcg 79440 

gtatcaaaaa cgcgatgtcp caccccagat ggcacgcgtc ttggttcaga ggtgtttgag 79500 

gtattgccac ggaaccataa attaagaaat ccatagccat tcatcataat cccatactga 79560 

caactgcaca aaacaatgcc gtcctccagg gcgtgtaacg cgttcacaag tctgctcaac 79620 

ataaaatcgt catccatata aggcgccggg ggcaaccaca actgaagcgg cccgggagcc 79680 

atgcagacgt ggtgttcatt tttatccgtt ggtctcgtgg ataaacgaac gcccatccca 79740 

gaatgggtaa aagtctgtgc ttgaaggacc ccataataat aaacctgaat atgcataaat 79800 

ccccacgggt cgtcaacaga tggcgccaaa cacgcagaaa gaccgctcga tgcagatagc 79860 

ggcagggcgt gttccccggg caacgcaacc gcgccccctt gactattgtc ctgcctaccg 79920 

tcttcaccgg aggcggttgc ggttgcagac gcctcggtgg ccggggcggt catggtgaag 79980 

gtaccggcgg gacgggtacg aagtagcccg agcagctgct ttaccgtgta ttcagatacc 80040 

ctgcccgcct cgtcggcgcc cttgcgctta aacatccgaa acctcaagcc ccgcagttct 80100 

gccgctaccc cagaggcccg agggcacaac ctgcataaca tctcttcatc ggagcgcaac 80160 

ctgaagacca cgtttgcaaa tgaaaaactc cgtctcaaaa cgtccttttc ttctatgaaa 80220 

taggcactct ttcttatcgc acccaatagg cgcgccttgg cctggcccag cctagacctc 80280 

ccggacggaa gggggatgtt tctctggttg cagtagtcct cgtacatctt cccatcccga 80340 

aggtaatcaa atcccacaga ccacggggtc gtagtcgcca gcacgaccct ggttttttcg 80400 

tcgtctgccc aaaacagtcc cgaatacgtt gcaagatttg cgtgagtaac gatccattct 80460 

cgaagtccca tggcggacgg accaccgctc tgaggtgggc gccaggtagc catggctgta 80520 

tatgccaggt tgatactcac cgctttcagt tcgtgttaaa ataaccaaac tccgcccatg 80580 

cggctaccgc cgtgggcgca gatgtgtgca gccgtcacaa aagaatccca cctgcggcaa 80640 

ctaattccct cagcgcccag tcagctggct accggttcat ttcactcacg agcacgccac 80700 

cccatcatct tcaccgctcg ttcccgcgtc cggatcgtcc attgaatctg gtccggagtc 80760 

gcccagctct actgccgggt catgcgattg gctcgtcccc tctcctggat cttccatctg 80820 

gaatgaacag tcggtgcgac cctctgatgt tacagtcgtc ccggggtaaa aacaacqctc 80880 

tgtaccgcat acagacgagg tttcgcacac gtaacgcagt ctcatggcga tgggcgcatc 80940 

gacgggattc ggcgactgga ccgacagaca cagggcgatc tgtgcataat ctctaggtgg 81000 

ttcgtctcca gggttcgggg accttgcaag agctctaaga taatccacca tatcaaaaac 81060 

ctgaacgggt tgcctggatg gcaggtcctt atactcctcc tcattgttag taagcacctt 81120 

catttcccgg ttctccaggt tcttcacgca aatgccagta tccgtaaggg taatcaacac 81180 
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acccttctcg cactttttgt gcataaccct tagatcttct ctggcaaact gacagtcaac 81240 

gacccccggt gacggtaaaa acaaagtcct ctcatcttgg aaacaacacg cgtggccagc 81300 

gaggacggga catggtcgac tgcagacccg tattccagca tgaacacgtt cgaggccaac 81360 

ggtctctcca aaatatagaa acgttaattg gatggcccct ggcagccgat gctcgcgcac 81420 

acgcttgttt ttgtccatcc gactccgagc caataccgtt gcggtaaaaa cccgtcttct 81480 

ttcacgtcca gacccacccc cctggtcgta ttggaactca ctgattaaac caaggcactg 81540 

agtttcggtg gaagacatca gattacacat tgcacaactt accaccggtt ccggcaacag 81600 

gcgaaacagt ctatatcgag cacccggtcg cccataggcc ttattaataa atttaagctc 81660 

tcgtatactt ttgtggtgtc tcatgtcgta caagaatctc attttcctga atctgccata 81720 

gtgggacggt ttttcgccct gacatatgcc tcgcatgttg cagtagtcat caaaaaactt 81780 

ctcgctattc catacggacc gatctgcagt tactttgttc cacgggaccc ggacaacggt 81840 

cttgtcctca ttctcccaaa ccaatccgcg gtactcgccc gattcgaccg cggcaacgag 81900 

ccaggccctg atatcgactc cacggcccgc cattccgcgt attgggagct tgacggatca 81960 

gctctcaaga caaatgaaac ccttgcgttc acggcaccag ttcataaatg tatctaaatg 82020 

acggagtcat gtgccctaaa catatggcta agactactca aactgtgctt aggggaagag 82080 

tgacaaacac ctggcactca ataaaatttt gaggtcgggc attgacccgt gccttcctct 82140 

ggttagtatt aaccccatcg ccaccatatt caaggaacaa aacactccag atacaaaaca 82200 

ctttattaca gatagaaggc gctcaccggc ctcacaaaac cggtcgcgcg cacaggtaac 82260 

attaaacatt agcggcggac aacgcccgcc acagacactc gtgccacagc tgaataatta 82320 

gcggtacagt gctttgcact ccgccctcgg gtgtaggtac gcccccaaaa tataggtaaa 82380 

cacaagcttg tggaagcgcc gctccgtccg gagatgttct ggccatggct gtcatatagt 82440 

caaccaagtt aaatgcacga cacgcgttgg cgcacgacag gaccgcaacc gcccccgtag 82500 

attcgacgtt cccttgaacg aacgccaggg aatgccccaa ccatctaatg taaatgccgg 82560 

aggggtcggc ccaaaacgcc aacccgcgca aaagatcttt atccaggaag cgcaacgcct 82620 

cggcctgaaa ctctgctaag tggcctggaa tctccggaac cagtgcctgc tcggcgatcg 82680 

gtgcggcgca tatgtgccct tgagggcgac gttcggacaa tgggagcaca cggatgccgt 82740 

tggacgattc agtcaccact tcaccaactc tgtccccata atagtaaaac attatacgta 82800 

gcatccaaca ctgttcaccc ccggctccca ccgcaccagt gtaccgaaca ccggcgccaa 82860 

ccggaccaaa cttattcacg gcctcctcgc gcaacccccg taacatagca gtagtgccgg 82920 

aggcctgatc gcaacgagca cacgacctca cggtaattgg taaaaggcga atgattcggc 82980 

atctgtccgg atatggagtc gaaaggttgt ccttagttga cacgtccgac acggtttgac 83040 

tatgccgtat ggcgctggat agccagtttt tgcattcctt agcggttaat tccctgccag 83100 

ccgcgtgcaa gatgcccctg gatcgacagt agtcaataaa aatgtttttt tccgcctcct 83160 

cgactcctct gctccctctg tcgtgattcc acggtatgcg gattaaagtt ctttcctcgt 83220 

caacccaccg cattccggga tgacgacctg tctcgcagca ttccaccagc catcctcgaa 83280 
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ggctggatgg tttggtcact cggaccggcc gctccatatc ctcgccgcgt gcacggcaat 83340 

gccaaacctt gtccaagtta cctcctgaca agcacctgcg accctcacag acctacgcgt 83400 

ggcaaacggg aacttgtggt ctagtctgct gggacagtac cttagcgtta tttttatctg 83460 

tcgtgcactc ctgcggcgga agtttgtggt ttatgagcac caccgggaag gattcatgtg 83520 

tccctgcacc agctcgtcct gacagaccac cacagaaata ggaacttgtg ggctctcttg 83580 

ggcaccaggc tgctgtgaga gatacagtgt tgccccgttg cacggcgcat ggctctctgg 83640 

tcccggagag cgggcaagct ttctcatgaa acccatgaag ttaaaaattt tctgtttaga 83700 

gaggaacatg acctctcgtt ggataatatc accattacag tcgttacccg ccgcgtacag 83760 

cgtttccctg ttcctattat cgacgtacac tccggtttcg ttgccgtgta cgtagatgcc 83820 

ctcgttatac gcgaccagca tcgtttttat ttgcgttatc tgttctgcgg tcagtttttc 83880 

acgatctgga gatggaaacc agacagtgaa cccggtgcca tagaaacaca gatgaccggg 83940 

atggggaacc gggcgcggat gcaaacgaac acctctccga tcttgggacg tgtgatccag 84000 

tgccagaacg ccaaaataaa acacctgaat tctcaaccgc caaaacggca caatttgatg 84060 

aggcagcggc gcttctggga ccccgagacg atgtgctcgg gcgtacgctg atctcgtcgg 84120 

agggaccaac gtgggcaagg gggcgggcag cggcattccc ggctctaacc cgatgccaca 84180 

cacctgctcc aaataattta taatcccttc agagttctca tcaaggatac acgcatagca 84240 

tccagaatcc atcaaaggtt ttacccggaa ggcggtccag ccgccaccgt gcgtccgcac 84300 

gaccccatgc tccacttctt gaaatccagc attctccctc aaacccctca aaaatcgtct 84360 

cttacactct agcaggtttc catctagccc aacctgcaga cctctctcca cgcagtaggc 84420 

gacaacggct ggagttccag ctgccccctt cttccaagac agcttaaagc tttttttccc 84480 

tcatcgcacc aagttaggtc tgaataggtt ttctcgttga gatggtaaag ggtccacgat 84540 

cgtagatggc tgtagttaga atctcgattc gccatgacag cgtacaccta tacaataagg 84600 

cgcgatacga- gtgtactttt cccctgatga tatttgcagg gctctatctc ctcaaaacgt 84660 

gcacctggct aagcaagcag ctctatttct gccacttcct catttataaa accatattgt 84720 

ttctggtctc acgttttgca gctcacctgc aacgacacgg tgaatgatcc cgagtttcgc 84780 

tcctccagtg ccttgacaaa cactggccca gagttaaaaa gtgaagcaaa aggcatagct 84840 

ttcgatttca gtcaccggca gtgagtggtg gacattagaa agtgtttggc ccacacattc 84900 

agctgtgcac tcaagcccga ccaatatggc acgtgaccat tcccctatca caacactagt 84960 

acagaaaaaa cgaaaccaca cgcgaggtga cattttccag gttagagaaa tttattaagc 85020 

cggcgaatcc atctaacaaa acagcaaatg tttattcaaa gtgcctatag atttctgttt 85080 

gataaacaat aatagatagt ggagccctcg aagagtttcg ctcctcgccc agggaacaac 85140 

ccgaaaccca gagtttaaca aaggccgcgg ggggacccgt gtcgcgcggg gagggggacc 85200 

tggccatatc aagcaaataa cgatcagtgt caaatgccca cacgtgcatc ccggtgtagg 85260 

cacgatgtgg caaactgtac ggttccacgg tatttccctg aaaccacacg ttaggacccc 35320 

cggtaatcgt gaacataatt cccctgctgg tgctgcagaa aattattcca tccacgagcc 85380 
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atttcaaagc 


gtcctcgagt 


ctctgcacca 


cagacgactc 


agtcgtctgt 


ggtatcggag 


85440 


gaaaccacgc 


ctgaaggggt 


ccatccaaaa 


cacacacgtg 


gtgtcccgcc 


ttatttgttg 


85500 


gttttgaaga 


tagcctgatt 


ccctgcccgg 


tggctgtcaa 


cagctcagcc 


tgcaattgcc 


85560 


catagtaata 


aatctttata 


tgcatgtgac 


cccatggatc 


gccggcagat 


ggcgcagcac 


85620 


acgccataag 


tctggcgcgc 


agaaggcccg 


aagaatacgc 


tgcatcgtcg 


tcctgcgcgt 


85680 


cgtgtccatc 


ttgatgacct 


tcccgatacc 


gtctctcttg 


atcacttacg 


gacggtcccg 


85740 


aaagggcgtg 


aagcattccg 


cgcccgagca 


tggcaaaccg 


catattgcgc 


aactcaaggc 


85800 


ggagcgcgca 


cacgcgcgga 


cacaccggac 


aagacatctc 


ttcggatgaa 


cgcaacctaa 


85860 


agaccagcgc 


cgtgaacgga 


aattgatcgg 


tagggaattc 


cttatcttcc 


tcaaagtact 


85920 


tgctctttct 


tacggctccc 


agcaggcgcc 


ccctggcttg 


acacagcctg 


ctccttccag 


85980 


acggcagtgg 


gatgtgtctt 


aactcacagt 


aggcatcata 


gtgttgaccg 


tcgcgttgat 


86040 


aatcataatt 


gggaaacgac 


ggtggtgtag 


ctgccaacac 


cagcctcgtc 


ttttgctcgt 


86100 


ctgcccaaaa 


caggcccggg 


tacagaccac 


tatcggcttt 


gaacgtcagc 


cattccttga 


86160 


gggccatgcc 


ataacgagat 


gggccccttc 


tcgattcccc 


agcggccatg 


attcagtcgg 


86220 


caacgtttaa 


aacccatgta 


aagtttcagt 


tcaatttaaa 


atgacacaac 


tccgcccatg 


86280 


ctgtgacaaa 


cataaagaaa 


gtggcacacg 


tgtgcgcttt 


ggccgtcctg 


tcagatgaac 


86340 


cccgccggta 


acatttactt 


cctcccatac 


ggtgcgaggg 


tagatacaga 


tgtgcttccc 


86400 


gttgttcccg 


aagcaccgac 


acccgctcca 


gacggtccct 


cgtccccgct 


gtcatctccc 


86460 


gcggcaccgt 


tgctgacatc 


atctgacaca 


tcatcattgc 


acacgtagcg 


aagtttcagg 


86520 


gcaatgggag 


catcccatgt 


attttcggac 


tgagtagacg 


ggcacatgaa 


aatccacgca 


86580 


cagtcacgcg 


gcgggacatc 


tcctggagtt 


ggcgatctcg 


ccagctccct 


caaataatgc 


86640 


aagagatcga 


acgcctggag 


cggctgcgaa 


ggccgcagct 


catagtaatt 


ttcagaatag 


86700 


ttggtcaatg 


ccgtcatgtt 


gcgttcttcc 


aggtttttca 


cacagattcc 


agactccgtc 


86760 


ataacacaga 


tcaaacctct 


ctcgcatgtt 


ttctgcaggg 


cctgcaaatc 


tgccctggca 


86820 


aactcgcagg 


gaatcaactg 


ggacgacggc 


aagaaacacg 


tcaatggatt 


atcggcacaa 


86880 


cacaggtgtc 


cgggacgttt 


gggatcagga 


aggccacata 


tacgcacacc 


cgagccggct 


86940 


cgcaaaattt 


gaactccgac 


gttctctcca 


aagtaaaaaa 


aggatacctc 


aacagcccct 


87000 


agcacaggtt 


gcgcagaaga 


gtcctggagt 


cgtttggatt 


tccgtttata 


aacggagact 


87060 


ctcagtggcg 


ccttagccct 


cggccgccgt 


gtcctctcga 


aaacatcctc 


tcgaacgcaa 


87120 


ctgcccatgg 


aacagctatg 


caacgatgtg 


gtcgcgatga 


ggttacacag 


atcgcagccc 


87180 


accgtaacct 


caggaagcaa 


ttgaaagagt 


ctgtatcttt 


ccccagagac 


gcccccggcc 


87240 


ttgctcggcg 


gtgtcagctc 


cctcaaatac 


ctgtgagatc 


tcactgcata 


taacatccta 


87300 


attttcttaa 


atttctgggc 


atgtgaaccg 


gtttgacaca 


cgccacgcgc 


tgagcaaaaa 


87360 


tcgtcaaata 


tcttgttata 


atcttcatcg 


acgcgactgt 


cggtgcaccg 


gttccacggg 


87420 


acacgtataa 


tggtcttgtc 


ctcgtcatcc 


cattccactc 


cggggtatcg 


cttagactcc 


87480 
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acggcctcga tgaaccaagc ttttaaatcc atatcgcgtt ccgccatctc gcgtattcaa 87540 

tacctctgac cgagcggctc agttaaaata ccgagccccc agaaccccag gcctccgtaa 87600 

caaaaaggat ctagatctga aagaagccta aetata tacg cccacgggaa aactctgtaa 87660 

caaatgaaaa aaatgttctc agaaaacgaa gagaggaaaa ctgacgacac gcaatctagt 87720 

caaaccagaa cctaaaggcc gtcactgacc catatctttc cccttcccga gcgagtgcgg 87780 

tttctcacat geegtgagea aacgaaaacg ctacacataa taagacacgt gtgagaggaa 87840 

aactttattg cagggacagg gcaaaagcaa gctgtgcacg gtaacagtat gtgtcactgg 87900 

gccgcatccc cgcacgcccg ccacagacac tcgtgccaaa tttgaatgga cagggggatt 87960 

gtgggacacg gctcgccttc cggagtggga acgcctccta ggtggagaga cgcataggcg 88020 

gctggaaatc gcgttccgtg tggcggggtg gctgacaccg cactcagata ttgggcgagc 88080 

gaaaacaege gcagaactcg cgcgttgtta attatatcca ccggctccag cgaacctccg 88140 

ttacccagca ggaaggctgg aacgtgaccc atataccgaa tgtaaatccc agacccgtct 88200 

gegtaaatea tcaacccgcg tatgatctcc ttccccagcc actttagcat getegtctgg 88260 

tgaggaaaca tatcctgtgg ggtatgegge aacagggect gttccggccg aatgcccgca 88320 

caaacgtgtc cttgcagtgg tegctttgge gccggcaaca ggcgaacgcc gttgggcgac 88380 

ccgacctcca tgttcccaac caaccgccca tagtaataaa caaccagccg tagcatccaa 88440 

cactgctccc cgtcgcctcc cacggcgcct tcgtactcca tcccggcacc gagatacegg 88500 

aatetggcgg ttacctcgtt gcgtacgtcc aggagcatgc cgccggtggc atccgcttcg 88560 

ttacagagcg ggcaagatcg tacaaagatc ggcaacaggc gaataacgcg gcacctgtta 88620 

ggcgccggcg ccgacaggtt ctcctcagtg gatacatctt egaeggtctg actgtggcgt 88680 

atagegctgg ccaaccagtt cttacattcc ctgacgctca atcgtctgtt accggtctgc 88740 

egtatgeett taaattgaca atagtccaga tacatgtttc tctcatcgtc agagacgeca 88800 

cccgcgccct ttaaatgatt ccaaggtaaa cgaaccagag tcttttcctc atccagccaa 88860 

tgcattccag gatgettate attategcaa caatcgagca accatgccct caggccagac 88920 

ggccggttta cgeggatega ccctgcccgt ccttccgcca tctctgaggg tccgctcgca 88980 

taaactggct taaaaatcta tccccccccg tgcacctgta ctttacagtc acaccttcgg 89040 

gcacacgccg tagggctggc agggaeggea cccagcagct attttcatct ctagcgccgc 89100 

etactagegg aaatctgegg tctacaaaat ccaccgagcc geccgaatgt caccatgagt 89160 

caattcatcc tgacagatca ccaaagtgac agggacctgg ggtcttttac tggagecagg 89220 

gcgaccccca aggtaaa*gag tgactgcgtt gctgggaaca gtgtcgccgg gaacgggtga 89280 

ccgggcgagt ctgcgcagta gacccataac atagaagatt tgagatttga cagaggatcg 89340 

cacagcccgt tgtgcgtggt tcccctctgc gtcgttccct gcgcagcgaa ccgtgtgcct 89400 

ggttctgtta tccacataca tccccagctc attgecatge aaaaagaege cctcctcaca 89460 

cccgtttata atctcacaca eggtttctet ctgctccggc gttageggee catccaactg 89520 

aggttttgga agccacatct tataegctgt cccatagaaa cactcgtgac ccggtttggg 89580 
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gtcttggcgc ttatgcaatc gtacccctct 


acggtcctgg 


gacgtgtgtt 


ccgctaccag 


a fit a r* 
89640 


cgacccaaaa taaaaaactc gtatacgtag 


ccgccaaaat 


ggagtgatct 


ggtctggatg 


89700 


cggtggtgca accgtcgcca acctggcggc 


cctggcatat 


acggaccgcg 


ccgcaccgtc 


89760 


agcctccgcc ggagcgggag gaggcagggg 


cgtccctggc 


tcaagcccat 


ggacgcaggc 


89820 


cgcctctaag taatttagga cggtctccga 


atgactatct 


aagatgcagg 


cataacacag 


89880 


cggatccacg gcgggtctca agtaaaaaaa 


tctttctccg 


ccaaaacgcg 


ttgttgttgc 


89940 


ccgggcatcg tcctgttcaa acccagcgtt 


ttcccgaagc 


acccttagca 


gtcttttccg 


90000 


gcactccgac acgttgtgct ggcgtccaca 


ttccagatcc 


ctatccaggc 


agtacgcaac 


90060 


cacaggttgc atgccactca tacctcggtg 


ccatgatatc 


ctaaagctcc 


tcttttcctc 


90120 


gtcgcaccaa cacaagtgtt cgggaaacgt 


gttcgcttcc 


aaattggaaa 


taatccatct 


90180 


tcgcaggtga ttgtgagtga tttctatctc 


ggtcattgtg 


aacttggtcc 


acgaaaacta 


90240 


actcaaaagc cctcaccact cccctaaact 


gtagctgaat 


tttatgtcca 


ccttataggg 


90300 


tgcacgtgta cgtatatgcc ccgtgaaaac 


ctcgcacttc 


ctcataaatt 


atccccaaag 


90360 


cgtttcgggt cgctcaccta aactaactcg 


agcattacac 


ggttcatagt 


gtcccgggtt 


90420 


tcgccactcc cccaccgcga cagagaaacg 


aaacgttcaa 


tttaaagggc 


accaagctca 


90480 


actttattta aggagaaaaa aaccaccgcc 


atcagaaggt 


ttccgcaggc 


acacaccccc 


90540 


aagactattc ttggcgagtt cgcggcacgc 


tgtagttttt 


ttctgattat 


ttgtaattcg 


90600 


catccaatcg ccccaaacat agtcaataaa 


aaaaccgtaa 


acacacacac 


tctggtgaac 


90660 


aaattatgcg ttcctggccc caagacgtgc 


gtgagcgtca 


gcagcacagc 


ggctgctaaa 


90720 


acataaacca gggctaaaac gtttgaccga 


aacacgagtc 


cgaggcaaca 


aaaaagccca 


90780 


acgtggaaga caaaaaaata gtgtacaagt 


ccgaggagca 


gaggcgtcgt 


caggccaaca 


90840 


gacagcgtac ccgcggccgc ccctccaatc 


aaaacagtca 


cgagataaaa 


gtcccatcca 


90900 


cacacctcaa gcgcttgctt tatagagtat 


gtgatgtacc 


gcctagaggt 


gagcgaaaaa 


90960 


attgcatggc gtcgctgcag gcccgcctca 


taaagcggat 


gtgaccgcag 


cgctctgagt 


91020 


gaacaaacgc cacctactat aaaggccacc 


aggggcagca 


aaaacaccgg 


ggctaaaaaa 


91080 


aggttcttaa gaactaaaaa ataatacacc 


gagaacgctg 


ctaaaaatcc 


aagttcgtag 


91140 


tatctgtggg caatcgtggg gcacaaatat 


accacgtcat 


tcgccgcaac 


aacacagagc 


91200 


agcaggcaca ggcacgaccc cacaaaaatg 


gaaacgtgaa 


cccatggaag 


gtcaacgccg 


91260 


agggtggatg ccatagaacc ggcggtaata 


agaaacgccg 


agatggtgca 


cggtacatcc 


91320 


acctgccgcg agagccaggc cggtagccca 


aatatacaga 


atagcaaaag 


cagcccatgt 


91380 


aagccaaccg cgccccagga gaatataatg 


ttctcaacgc 


taatcagggg 


ctctacaaaa 


91440 


tacagcgagt gataaataaa acacagaata 


aacggggaag 


ccgagaccgt 


cccgctcata 


91500 


aaagatagcc aggcgaggga cgcctccgac 


gtgtacgtgc 


ccatggcgcg 


gcgtttaaaa 


91560 


caacgggttg aacgtcgcgc gaggcgtctt 


ggtttttcct 


ttcccgcgac 


tgggtgagct 


91620 


gctgtgtttt cgcttgccgt gttttttttc 


cggtcgagag 


ccgctcgtcc 


gcagtgattt 


91680 
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ggcgcccccg tctgtcggct cggcaccgtc ctcggcaatt gggacgcact gctcttttaa 91740 

gtccaggtca ggcgttctcg gaggggacgg gagctcagga accccctcgg gatcggcacc 91800 

ttccgagaga ttgcccggga cggcattatt ccggtccgtc ggatctacct ttaatagcat 91860 

gacgcccaac ctgacgtcag caaggctatc tgtcagcggc aggccctcaa cttccagaat 91920 

ctcattggcg tgccagcgga aaccaggcaa aaaaaatccc -ggaatcttac atgcgttcaa 91980 

tgcggccacc agagactcta aactcacatg agtcgtcagg tcgctagaaa ccacacccac 92040 

gtccagcttt cttcctctag cgcaagtcca cttggcgtcc ggggtgtcgg gtttaaaatc 92100 

aataatttta ctaaccccgt cgaccgtaaa cgtggcaact gccagcgttt cgcttaacga 92160 

gacggtgacc acctgcttgg ccctggtttt ctgctgcctt agccacttta acaacaaagt 92220 

cagcgtctta accgtaagga acaggcgatt ccccgtcccg gcctgctcta gcctgtcgtg 92280 

gaacgagggt aggttgggtt cgtccacgac gctcttgtgc acggtgctcg tgtgattgtc 92340 

cccgtaggcg atggtcgtcc taacaaactg cggctttcct tccgcgtccg actcgtcccc 92400 

gtggtggtag aacgccacgt tcatttccgc aatgttaact ccaaacatct cacgggtgtg 92460 

tgtgaacgtg ttcccaaggg aggtgtttcg aaagctaaag ctaacctcgt ccgtcatgtc 92520 

agaatatacc atcagcggcg ccagtgcatt tttaagcctg agaccgagaa cgcccgcgtc 92580 

ccccacgctg gagagcacgc taaggacggg ggccctgccc tggccgctga tctgtataac 92640 

gcctttctta acgatgccct taatgtggtc gtaaaccctg ctaatgctgc caagcgcatc 92700 

gacgtccacc ctggcgccgt aatgaaacga gacaggcatg gctaaggcga aattaacacc 92760 

ctaaaagacg agaagagaat ggcaaacacg acagacacag cgttcaatgt tcaacctttt 92820 

aatagagcct ctggcccgga aaatgagtaa catgggacac atggggacta cagatcatta 92880 

tgaacggatg tgacatattc agaattatca cgctcaaaaa agttcacact cttggcactc 92940 

gtatacgaca agggacacgc gtttgggggg cgggcaccga acaggggcgg tatgtttatt 93000 

gactttagaa tcctgtcggc cgtggcctgc aaaaactgct cgatgtctcc gacgttcacc 93060 

atggtcacgt ctcggctctt ggaacgaata aacagagtct ctatctccac ggcttcgcga 93120 

aagaggccgt ggatgtatgc gacggagggg gattcgtctc gggagatcat ggtgttgtac 93180 

agcaagctgg cggccctggt gtgtagcagc tcgtccctgg agatgtagtc gttagccagg 93240 

cacaccccgc gcatgattcc gcgaaggcga aaaagtccta tagagtaaaa tgaacttata 93300 

aagtatatcc cttcgataac caggaacagc agcaccttct cggcccgcgt ggtggccttg 93360 

cgcactcggg aatgcagcca gtcaatcttc ttcgccaggg tagcgtcctc gactattttt 93420 

ttggcataaa tcattaagtc cccccgatta ccgccaaaaa ataagttcag tatgttggca 93480 

tacactttgc cgtggatgtt ttccatggct acctgttcgg cgtaatagtg ggtcacgtcg 93540 

tgattgctga actctttaac taggtcttcg atgttaaagt ttaccagact ctctgccata 93600 

cccaaaaatg tgaagagaaa ctggtaaaac tccctat&gg tgggactcaa tagtcgcacc 93660 

tcatcggagt cctcgtgcag ggaaatttga ctggggaacc atctgttctg ccatgtctcc 93720 

tgggtgagtg ctaaaaagcc agggtggtcg ctcgcgtaca gaaaccgatc tggggactcc 93780 
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atggcggcgg 


ttactattga 


caggccaaac 


ataccttggt 


ccgctcaccg 


tcggcctttt 


93840 


gtacgcagtt 


agactccgcc 


ccctctgaag 


agatgttgtc 


acccacgacc 


tgatctaagc 


93900 


actgcagggc 


attaagtttt 


gtctgtttct 


gaattcgaca 


ataatacagc 


atggttttta 


93960 


accctagcct 


gtatccggtc 


aggagcaagt 


ctcgaatata 
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aagcaactgc gttatctacc tcgcacaaag ttatttcaat agagagtccc ccgtgacgga 100140 

cactaacgac ctcgacgacg ttttacggca aggggcgacg ttggacttta tactgagacg 100200 

gtccgggacg ctcggctata accaatatgc ccaactgcac cacataccca gctttatcaa 100260 

gaccaacgag tggaccgcgg ccatctttca atctcaggag tactttggac taattggact 100320 

tgacgcggcc atccgcgaac ctttcatcga atccctaaaa tcgatcctaa cgcgaaatta 100380 

cgccggcacc gttcaatact ttctgtttat atgcggcgac aaagccgggg ccgtaattat 100440 

caaaaacaaa acgttttacc tgttcgatcc ccattgcgta ccccacgtac caaatagtcc 100500 

ggcacacgta atcagttcgt cagaccccac cgccatacta gagtacgtgt caccgccaga 100560 

cagagaatac actggcagtt ttctatacat tatgcccagc gaatatgtca atccagagca 100620 

ctacatcacc aaccactata gaactataac gttcgccaaa gtacacggcc cgcacataga 100680 

tatatccacc ggcatagagc cgtgcaccat cgaagacatc ccaagcccgc cgcgatcgcc 100740 

ggatgtgacg tcaaaatcat ccaacctcgc acgcgtgccg agaaccacca ccgacacgtc 100800 

gagcgccaaa cccccaccgg cgacgctatc cggtttacgg ggcgcggaac caccgacaag 100860 

ctacccagac ccggcaacca acgacgcgga cacaaaactc ttaactcccg ctccagcgca 100920 

aaccgccgtc gaccaccccg aattccaaac aacacctgga gccacgctac ttctttcaga 100980 

actgtcggca tccaggggtc ggaaacgcaa gctttccagc cttcagcgat attcggattc 101040 

cgacgaagcg tcgtctgacg acgaaggggc cccacgtagg cgcgtacacg acgatgcgat 101100 

atccgccgag gtgatctgga tggacgacga tatatctcct ctatattcac cgtcggcgac 101160 

tccatcgttt gacgacgtgt tcgacagccc cccaatgagc ccggagttta catacgaaga 101220 

cgcgacagag gacacggatg gcgcgtttct agaacagatc gctcgagacg cggaaacgcc 101280 

gttctctgcg ttcgacgacc ttataacgga ccacgacttt tcttcccttg ataaaaaaat 101340 

agaacagtta atcaagtacg aagcgccctc gcagcacctg ccaaacatct cggacaaaca 101400 

aaacgggcga gccgtccgag aagcggcggc cctccaggcg atggacaaaa ttatgattaa 101460 

tatcatactt gaacacggtc taattacaga cgcgcaggcc cggggaccgt ccgcgtgcaa 101520 

aaacgttctt caatttttca tcctgtgggg agaaaaactc aacataccaa taagcgacgc 101580 

caagcaggtc ctagaactcg atctgcaact gataccgtta catacggcta tcagcgaagg 101640 

aaaattcaaa cagggggcgt tcaaaaaaca cctaacaact aaaatcaacc ggtgtctggc 101700 

gtcaatgagg gccacgcacg cagatgcgca aaaaaaactg gcgtcggctt tcaacgtcga 101760 

aggctcacag atttcgtcca gcgaagcgaa aatatcagtc cgggcgctga aggaacagat 101820 

cgccaaccac ctgagtccag gctttttagc ggtctactcc gcggacgagg taaaacatct 101880 

acgggataaa attcaggacc taaaaacagg catcgagcag cgcaacaaag aaatccaaca 101940 

ggaagaactg ttttttgatg ccatgctcac agccctggac acgttccaac cccctccgaa 102000 

aacggcattt ccaatggaga tctttccgca ccgtaaaacc gaagttatgc tcgaccacct 102060 

ggcgtccata accaccaggt taaccgagga cgccaccgaa gccctcaaca attacctgga 102120 

gaccccgccc gaccaaggaa cgcacattac caacattcca aacttttcat ccatcgtggc 102180 
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aaatatcata tccacgttaa aaatcctaac gtacgcagaa aacgacatgc aattaaacgt 102240 

aacgcccatg gcaacgtaca ggcgtcagct gttgtacctc ggaggcgagt tagcaaccat 102300 

ctttaattta gagtggccat acgaaaccgt gccaccggtt caagaactgc ccctcgtggc 102360 

gcgggcgaaa gcaaaaatgg aatcggtaac aaaaatggaa aagaaccaac aggctctcga 102420 

ccaaatactg ggagacgccg aaacgttact tgacacaata accgcaacat ccggagatga 102480 

gaacccggtc cgcgccatgt ccataccgat actggagacc tacattacaa acgcaggcgc 102540 

cctgataggc agttctcgaa accagcggtt cgaaaaactc aaggccgcca tccacgacct 102600 

ggcatcatcg gagtcgttca taataatgct gctaaacaac acgcggctcg ataacatatc 102660 

agacaatctg gccaagatcg acggcatcct gaccaacaac acacgttttc tttcaaacgc 102720 

cactgttagc aaaacgctcc aaacgctggg aggcagccta atacgcgaat gcgtagaagc 102780 

gctaaataaa aggagcccct cttccctcaa caacgcgcgt ctcctcgcgg ttcaaaccat 102840 

actggggcac gcgtccgttc cagatcacga gacgctgacg cgaatcgttt ccggcgtcgc 102900 

cagcgcacaa aaggaatccg ctggcgatga tccagatagg tggacgcgag taaccggtca 102960 

cctaaacgag ctgaagctcg taactaccca atcgcgtgtc gacaaagcca ccaggcgcaa 103020 

actgttaatg ataataaccc gtgacctcaa ggaggcggag gtgtctcagg aaacggtcct 103080 

ggaaacacgg tggcaagaaa acgtgctaaa gtttcaaccg tcgacgtcca aagaaatcga 103140 

agacttttta cagtcggcac cgtcagcaaa ggcccgaaaa ttcgcagaaa aacacctacg 103200 

gacgctgatc acccaattca acggccacga gcgaccgccg tccgaggcca ccgccgttcc 103260 

catggactac acgccgacgc ccatacccac gccacaggcc gtttctacgg ctaccgcgga 103320 

aaagggaaag gccgcatgga ataaaattca acaggccttt caggatttca actttcacct 103380 

catcgacgct tcggattggc aagagatggc atcagaatac tccagacacg gctcgtccct 103440 

tcctggtacg gttggaccaa agctggtgcg cttcatggag agcatctcaa acaccctgga 103500 

cgacatcctc acgcagaagc tggcatctct gcttccaaac gggcccgcgt tcagaccccc 103560 

agcgtttgac tggatcgcgc cctatcaaac acgcgtaaac gcgtttctaa aaaccatagg 103620 

cctgcccatg gtgcgcaacc tggcggacaa gatccatcac caatgccaaa ctgtcagtca 103680 

cgcggtgcaa tccgcagacc ttcaacaggc cacggtggga acaagtttag aacgacccgc 103740 

ggccgaatac tgtcgaatac tctctgacat gcaagtcgcg ttcaacgacc acggaatcgc 103800 

cgtaagatcg gaggccgcgg cgtacacgga cgcaatcaac tcgccggcca acgtcgtgac 103860 

tcccccgaaa cccaacctag aagcccccaa gaagctaata acggcaactg acgccctaac 103920 

cgtcgaggac tttccagatt tcctaaaaac gtcaatcctt caacaggagc agcgactcat 103980 

tgcgctccag agagcggaat ttcagcaact agaggccagc atctcggcgg ccgaacggct 104040 

ccgccaatcc acccgtgacg agatcgcagg caagatggca accgctatca cgcaactctt 104100 

accccgcgcc cccgtcgcaa tatcatcgag accgttgaac ttatcaaaac ctatagactt 104160 

tttgagttca acggtatacg acaaaatcct ggacaaggag ccttacgaga cagccatagc 104220 

gggattcgcg tggctggaaa tcgcgacaaa atccgtaatg gtctacagtc aacaaaacga 104280 
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aacgcaacag ttaaacgtac tgctgagcga ggtagaaaaa cagagcaccg tcgcgcagcg 104340 

tctacacgat ttggaactgt cggcgaaaaa cacggacgac gtaaaggtgc tgaagcaggc 104400 

gctagacgaa ctcgcgcccc tcagggtaaa gggcggaaaa accaccgtag acgcgtggaa 104460 

acaaaaactg gaaagcatag aatccctgct tcgcgccacg aggacggcag gcgaaatatc 104520 

atcggagctt gaacgcatcg gcacacaggc ggttggcacc atcaccgtcc gcgatttagg 104580 

aacgctctcc gatcaatgcc gggaagccgc aaatttcctc agacaggcca gtctacccga 104640 

aggcttctcg gacataggca caaaactcag cgagcttcag gcgtacatta agtacaaaaa 104700 

acagtttctg gagcattttg aaacaaccca gcctaacgtc tttcaacgct tcccgctatc 104760 

ccaaaacata accgaaaacg ttccggcgcg cccggcgatg gactcggtgg ccagactgac 104820 

caatcacctt cacgtgcgcg gcagcgcgcc ccactttaca acgtggatag aaacgctacc 104880 

gaccgtcgat ccggaaaaac caactcacgt cccggcgcac ggaggagccc ctctgcaccg 104940 

ccagatcacg tactcaaacg tcctagaggc gttgttttca ttatgttcca ccacgctaac 105000 

cccggttccg acggcccccg gtctggaaat cgcaaccagg gcacgccgcg gggcagaggc 105060 

cgcaacgtgg atggacagac agtggcccga catcgctcag acgctccaag acgttctcga 105120 

cacgtacgaa cacaccaccg cccacgcaaa ccgggacgcg gcatttaaca cattcttggc 105180 

gatgtgcgtt tttacgcaaa tcatcagggg cgctagcaga gccgtgacgc tcccgaagtt 105240 

accgagcacc gccgtcgatt ttccagaaga gatcgttcta acacccaggg aatgcacaac 105300 

actggtcacc gccatgtggc ccaccctggc ggccgcaatc ttacgattaa aatcctactc 105360 

ggaagcccta ggactaatga gtcgtttcct cccgctaatg ttccaggcgc tgccgcacct 105420 

aacgctagag gcccaggtca aaaacggccc acataacacc ccgcctcagt tgagatgctt 105480 

tgccaaaaca gaggcaattc cgtatttccc ggcgcaatgg cagtcagcga acctagagca 105540 

gagcctgtgg ggacagacgg actttttgca aatctgcgat aacaatcaac gcaaggccag 105600 

ggtggcggcc gtcacctggg cgctcacgac gatagacggc gtggttttgg accaactgtg 105660 

gtccacattt aaacccatga cagccgcgtc agacgacacg tacgtcgacc tagtcgagac 105720 

cctacacctg accacctttg gcccgcgcgg tccaacgccg aggcgagaaa cgaccaccga 105780 

gcacccgccg tacgagtacg gacagcccac gggctactgc atctcgggtc aatcgacgac 105840 

gccggtccag gcttcaaaca caccggtatc cgctttcgag gcggtgctcg gagcaatggt 105900 

gtttcacgta ccgatcagaa tatttttggc ggccacgccc aagcgccttg gccaggcgcg 105960 

cggcggcatg gggctcctca cacccatcct ggaatgcgtc cccgacgtcg agcccttcaa 106020 

aagcctgtat aacgcacccc gcaaacccgt gcccattgaa acgctacccg catccctcca 106080 

cccgcaegac gagcgacagg tctttctgag acaggcacag tggctatcct accgattcac 106140 

accacacgaa gccgcccggt cgtcgactcc gccgcttctg gtggtcatag accctgaaaa 106200 

cctcgtaacg gcaacgtact ccagtggcgg gcctgcaaat ttcgagagca ggccgtttta 106260 

cgtgatgccc ggaccatacc ccccagactg gccaaaaacg ctgtcggtaa catcaaacac 106320 

gtccgtgacg cacctcagcc acgacgagat atgtaacctc tttactacgc tatcccgaga 106380 
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acacgggacc gtgcaaggca gggatatctt cgcagcggct ccgacaaacg tcacaccgga 106440 



cgccaaaaaa cctcatataa ttcctgcgtc tcctaaagcg cggacagatc caccggtgga 106560 
aaccacgacc caccattcac aagggcaagc gtcgcaacac gcaaacagca acgtaaacca 106620 
gcccggtcaa attacttcac acgcgtcacg taacacaccg tcaaccgcac ctcaggcctc 106680 
atcttcaccg gaaaaattca acacgcaaac ggtgcctcga ctaatatctc aaacgtcgga 106740 
aacggcccat ataaaccagc cagcctccgg ccaggtcacc gaaccaaagg gaatctttgg 106800 
gacgtataaa ccccgagtgc tcaccgaacc cgccaaaccc gcaaacgccg gcgtagcctc 106860 
tcgccaacca gaggcaacca ccacggtccc caagttaccg attaatccac ccaccgctag 106920 
ggtctttata gggaccgcgt ccaaactctc gccagccgtc gaagagagcc acggcgccac 106980 
acccgacgca catcagtcga agatagatcg ggaaaaatac gccgagagtc ggcctcgccg 107040 
caccccacac ctcgaagagg ggccacggga gcctcacgtc aacactccaa ccagcgcaca 107100 
cataaacgtc ccctctagcc aaggtcaaaa aacagtacac gggcgcgaaa atcccggcct 107160 
tcaaacagca actcccagcg ccccccaacc aaccgcatca aacccgcgca ttcaatacac 107220 
gctccccaga acggacggcc ggttgcttca cgacgaatcg gaggtggaat cgaccccaac 107280 
cgaggaggta aaacgatcgc caaaaacaca agatgtgtct cacgggcccg aaccggacga 107340 
ctccaggtgg accgccccgc tcggtccaac catagagatt catcgactgg aacaccccca 107400 
aattctcaaa aatataacat cactcaccgt ccccactccc agagtcaccc caatccctcc 107460 
cactaacatc tggatacccc tatcccacgt caacatccaa cacgaagaaa tcacacgagc 107520 
caagaatgtg ttaatgcgat ttattcaaaa cgtacgaaga aaacttcaag cgtcgtctga 107580 
cgctctatcc gaggctattg ccagaataaa gtttttatat ctgtaacgcg cccatctcac 107640 
ttgctttttt tattttgaga cgagcgtctt gtgtccagag tagttgcgtc gctaggtgat 107700 
aacgaagtgg accctatgcc agaagacgcc acgctcccgg gtccgccgcc accgggcgcg 107760 
gggcccatcg agcctatcaa tgaatggggc ccgctggaga tcgtagtaaa gctatttgac 107820 
ccgagggtgg aggccaccgg ggcaacgctc ggggcggccg agccggaccc ggacaaaacg 107880 
cccgagatac tagaactagc gtcctttttg ccgcgaaggc cccggaggtg gtctttgcgc 107940 
agaattccat tcttcttttg catatacatg tcgtaatgat gtttggccgt taaaaacacc 10B000 
agataattac gtttcgcgat ggcatactgg gcgggagaca tgtcaccctg gggaaggttg 108060 
ttcatctcgg caaccagcgg gtgatttggg taatcgtgct caaggcgtcc ctgaacgatt 108120 
ggctccttaa cccgcaagga cgacatctct tttaacttct attacacttt ccacaggaca 108180 
gggacgatat agacgaggtc aaataaaaca gctcggccac acgcaaatgc tttaataggc 108240 
cggtcgcgca gtccggcgac gccaaacagg gcacgacgac gctaaccagg gagttcgcgt 108300 
ctcgtatggc atgtgccgcg ttgttggcga gcacgcaccg taaataggga tctccaacac 108360 
acgtgatctc gaatagagat ataacccgca tgtgcgatcc gccacaataa gagcaatata 108420 
cgcgcccggt ggtagcacag atcgaaagct gcttctcttt ttggtcgcga ctgaaaaaca 108480 



acaaaccgcc 



aatcctccgg catgggaaac 



ggataaccga ttaataacgc 



aaacagaaac 106500 
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cgttggtggg tgggaaattt acggtttcaa atttaccccg tccgaaattc aaacagtaac 108540 

cgcactcgag gcacaccacc accctcggag ctggcacggt cttctccagt acgctcctgg 108600 

ccaccacctg ggaccaaaca ggtagagaga tacacggaaa cagtacgtta tacgccaata 108660 

ctttttgacc caggtcgcgg gatatctccg tctcggtcga ctcccctatg ggcaacacaa 108720 

cacgggacat gctcagcagg gccctaaacg tcaggctcct cagaagggcg ttaaacgggt 108780 

tgccgcacgg gacggtcggc gccagttctc gcagcgaggc cagaagtccc gcgtccgaag 108840 

ggcccgggac actctcattc aggttagctc ccagacgtct ggaaatggac ggaacgttca 108900 

actgcatcga gacacaaccg cccccgttcc atttcttccg caaaacgggc agatccaacg 108960 

cgtgctgtgg caaacaggta accaggggaa accgctggcg acagttaagg gttttgcaca 109020 

cgagacaaca cgccctctga aacgacacaa caagatacct ggaccacgcg ctcctgggaa 109080 

cggccgttaa acttaaacct tcgtcgctgg gacagcccac gccggtgcag atacacctca 109140 

gcacccacgc gtacgcctct aacaacgacc ggccgatatc gtgcagccgc gatctcacgt 109200 

cgccgttttt aggtgggtta tccggcctcc agccggtagc aatctcgttc agggcggtct 109260 

gaaaggatgg ggcagaaatt aacgtgcaag cccattttgg gggtcgtccg tcccaggcac 109320 

cgagcccgta cgtcacaaaa cacacgtagc attcggggca tagtccgatt gagcgtatag 109380 

aggccgcgag atctaagccc agccgagaac catccagcca acgatgggca ggataagaac 109440 

cgcgcccatg acaggcagct tcgtcttcag ccaggttagg caagcggccc gaggccatcc 109500 

cccaattgta ccgattgaat tggttaattg gtcatcgccg cacgctctcc gcgcccaata 109560 

tccttcaact ccgaccccga agggcgggcc acggagcccg ttcgcctcaa cgcaccgagg 109620 

ccgtcggagt gacccgtcag ggcaaaaacc gttctaagaa gggtttttaa ccgtttagcg 109680 

ctctttggag tcacgaccaa aaactgtaaa acctgtcggt gctccgtaaa gtaggtgcgg 109740 

catatgacca tggagctgta aacgtttagg tctccggaga aaaccagacg tgccttaaat 109800 

ttcataaaat cgtcctggcc cagggacacg gacgagttcc tctcaagata cacgtccgaa 109860 

tttataggca ggttttttcc aaactgggca tcggcgtcac gtggcttaca caaaaaacat 109920 

ttcagcgtgg tggccaaacc gttgttgata attacaaaac acggggcaaa cgggtaggcc 109980 

agtctctcta gtttgtggag ccaaaactta tacacaaact cgagatgata gacgcagccg 110040 

tgctgcaggc gcacggtgca cacggggacc gccccgcctt tagcgtatac gggagccccg 110100 

tcctgacacc tctccaagtc cagggagatt ccagagggtc ccaggtaaga gacaactaaa 110160 

tcgcacagct cgtcaactaa acgttttccg gaactcatcg ttataaagat cctttaggtg 110220 

ctgtgcgtgg ctcccgtaaa aaccgcgtcc gtgctaacga ttttgtgaat gacctgtttt 110280 

acggcgttta ccttggcgtc caggaccatg cagtgctcac agtgagctga ccgcgtctga 110340 

gcacgatgac agaggaaagt ttttaaatac tgacagtagt taatggcgtt gagcctggaa 110400 

tatatggtgg gaaacataat tttcatgtca tcgggcagca gggactcgaa cgccaataaa 110460 

tcgtcaccga acatcacgtg agacagaggt aaaagatgct caccgccggt accgcgtaac 110520 

acgcgaccag ccaccccttc aaatatttta gccttaaaaa gcgggccccc taaagtcgtc 110580 
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caactcagct taaaaactcc tacccattat gttcgtcccg tggcaactgg agaccctcat 110640 

gagacactgg ccctctctgc ggggactcgt agaacaatcc ttcctccccg gtaccccgga 110700 

cggagctttt aacagcccgg tattaatcca cactcaggac tctctacaac ccgcctcatc 110760 

gtgcagggtg tgtagcctcc tgttcactct ggtccggaca ttcccacccc ccgactcttt 110820 

cttcgaagac tacggctggt tgtgcctcac ctgcctatac gccccccgat catggacggc 110880 

taccctcatg gtggctgccg accttttgga actaacgcac gtgtacttcc cgcaatgcgt 110940 

gaaagatggg ccagtataca ccgcccaaag catcctcgga atcgacgtcc agctgcactt 111000 

cttcgcaacc cgctgcttcc gacccatcga cagagaacaa atactccaca catctcattt 111060 

aaatttttta caaaccgagt ttattagggg catgttagaa ggcacgattc cgggatcgtt 111120 

ctgttttaaa acgtcctggc cgcgcacaga aaaggacgac caacaaccta ccgttgcgtg 111180 

ttgttccgtt ggccgcggaa gtcacaccaa ccgggataac cgcctacccg aggacctgga 111240 

agaggcgttc aactccacga acgccgagga aaagcccagc ctcctcggcg tcttttcggc 111300 

aacgtgggca gaatcccagc ttcttggctc cgacacacaa caggcagata cccatttaca 111360 

accctccgcc ttcccaaccc cagaagatgc tgaccaatca cagggcccct gcctgatgca 111420 

cccaacgctc aacctaaaaa caaaaaacca caccgcatcc atatgcgttc tatgcgagtg 111480 

tctggccgca cacccggacg ccggtccggt tctgaaagat ctgcgtcgcg acattctgga 111540 

aaacatggaa aacaacgtta agctcgtcaa tcgcatatcg tacatcctaa acgatccgga 111600 

ctcactgtca cacgtgcgcg acgaacatct gcgcggccta attaaacggt gctcggcaca 111660 

agaaatccac aagcattttt tttgcgaccc ggtgtgcgtc ctgaacacgt actcgcactg 111720 

tcccgcggtt ttatttaaat gcccacctcc cgaaaagtat aagaagctca aagctcgtct 111780 

ggcaaccgga gagttcctag actgcaacag aatatttgac tgcgagacct tacagaccct 111840 

ggccgtcctc tttaaggggt ctcaactggc caaaatcggc aaaaccacgt cgctcgagat 111900 

aatccgtgaa ctcggatttc aactgcgtcg acacaacatt caaatcaccc acccgtttca 111960 

aacctccaac ctatacattt aatcttcaga agcgcaccag acaatgccaa aacagcccag 112020 

aagtcgattg gcgtctcgag cgccgtacgc acctagcgtc aggcgaccgg acgggcccca 112080 

gtccacgcga ccggcatcca ggcacggcag ctgcaaaagc gaaatcatgc agtggaaaaa 112140 

gttagtttca gacacgcagt ttttttctgc cctaacgcgc cgccacgagc tgggggtgga 112200 

ctttttaaga gaaatgggga ccccgatatg cacctcaaag tccgttatgt tgccgttaaa 112260 

cctaaaaacc atcgccccgg gtcggtgcgt ctctctctca tcattcggac actcgtcaaa 112320 

catggggttc aactgttcgt cgtgcacgcc aactgacagg tcagcggtgt ctctggacgc 112380 

aaacgcgctc ggcgaagatt ccgccaggaa aaacagcgag ctgtgttcag tggcgttaac 112440 

cttttaccac cacgccgaaa aggtcgtgca gcacaagggc ttttacctgt ctctgctcag 112500 

ccactccatg gaagtcgtta ggaaaagctt cacgcaaccc gggttgctct acgcccacct 112560 

agtcctaaaa acctttggcc acgatccttt acctattttt acagtcgatg ccgatgagag 112620 

actcgcactc tgggcggtgt tccacactag agacctacac ctgggggaaa ccagtctgcg 112680 
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actcattatg gacaaccttc caaattatga cataacggtg gactgcatca agcaaacgta 112740 

cataatgaag tttacaccct cgcgaccgga caacgcaacc gtgacggttc ctgtcaacag 112800 

catttgcgag gccgtggcca ccctagactg caccgacgag tttcgagaag aaattcaaag 112860 

gggcacggcc atcataaact cccaggggct attgtaacct ttccctagac ggaaacagat 112920 

gtaattcacc tattcaaatg ttaagttttt gttttgaaca tatcaccaat aaaaacaact 112980 

tatgtttaca caaattgaag agcgtttcaa ttttaccata aacatagcaa aaaccacggt 113040 

aactaaaact caatagcgat attcggatgg gatccctaga gagcaatcga cgtgcgccgg 113100 

tattaattga cgggaacgca cgttaactgc tctccacgaa tcgcaaactc cgcgttttta 113160 

ggaccattcg acgccgttaa taattggcag taaactgcga aagggcgtac atcgcagtag 113220 

tgattttacg atgtgcacac ctttccgcgg ttcaccgcca acaacaaaag cgggtcacca 113280 

ggcacctaaa tcgcccgggc cttttatgcc aaagtacaaa agggaccggc gttctacgtt 113340 

cacgaacaac actgttgatc gcaaatacaa tagtgttgtt ggtgaccgca aactgccaaa 113400 

tcctaaaatg catggacata gtcctcaaaa tctagaacga acttccaaaa acatgcggca 113460 

ataaacaacc ccttgcagtt aatgcacatc gtaacgagac ggaatccaaa gaacacgccc 113520 

gatcttaaga ccacgggcac gtgtctttgg gttccggggg cgttacgatg gaagttcatc 113580 

accaagtatc ccatcaacct gttacaaaag cgtaaagata atcggtttac aataaacaat 113640 

aaaaaatagt ggacgaagtt acattccgac atacaagcgg acccaaaaaa acaccccaac 113700 

gcaaaacaac tggagactgt gtctttgggt cccttccacg tcgcaatgag atttctggtg 113760 

accgtgtgct gcttaacgcg ttacagttgt gttttatggc tacgtaaaca caccaatcta 113820 

gaatgctaac acgtgcaccg cgcttaggcg cgtttgtgca actaactgct gtcaggttac 113880 

tactcttatt tttaacaagt agcataccaa ctgcaattaa ttgcactaaa cccagtttag 113940 

tctttttagt cgagtcggcc catagtcttc atggagaacc atcattcatc tccgtcaacg 114000 

gcgtcggtgt taccgttgtt attgcgggca aaacaggctc gggggaaaaa agagcaacat 114060 

atcgtcagtt taaaacaact caagttctaa aatctcaccc actcgtaaac aaaaaacatc 114120 

gcagttaagg tgtatgtacc gaacataaca acacaagttt tttaaaacac agctgcggta 114180 

agtaaccccc attgccacgc gtgtcgcgtg ctaagtgttt ttaaaattac attgtgcgtt 114240 

ttacacaccg agcagtaatc tcaggagggc ggttaacgag cgatatacat attccctaaa 114300 

cacgggaacg cgcgctgacc gccctcccca aatcacaaca cgggactaca aagcctagtg 114360 

ttaatataat caaattaaaa aaccacagaa acctttagtc gtgcgcaaac actagcaaag 114420 

gtacctagag ctttccccta tacttcaaaa aacagcggtg ggttatttga ccacacgtta 114480 

agtaaacacc cgtaagaatt attcccgttt atcaaaatgg aaaaattaag gctttgcgtt 114540 

aaaatctgct aacgcaaagg gcacttaatt tttccagttt ggactcggaa ctttaccgct 114600 

aacgttaaaa tttaataatg caaaaggcac ataatttttt ttgtgtaacc tcaaaactct 114660 

acagctaaag ttaataacgg cgactttggg cagcgaatca gtgtctgtcg caaaccctgt 114720 

taaatttaaa cacaacaggg agcgcgacta aacacggact aactgctcat cgggccctgt 114780 
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cgccgccctc ctcgcctcgg tgcacgctgc cgagcgtgcc acctgtttgg ttgtgtcatc 1X9040 

ccatgtgcgc atccccccat cttatttccc cgaccaggca acataaacgt cacgtcgttt 119100 

gtaagaaata attgcttttt ttattttcgt taaccccgca accgcgatac agtctctcgc 119160 

ctttcacccg ttgttttgtt accagcccct cttggcggcc ggcaccggcg ttaaccgggt 119220 

gcgttggcgg ccacgacctt ggccacgtcg cgcctggaga cggcctgcag caggtcactt 119280 

aaaaccatgt agtttgaggg gctgaccagg gccgcctttt ccatttgcga gagccaccga 119340 

aaaaaggtgg gtgactggtt gcgcctgcga ccaagctggt cgcccgttag aaaaactaaa 119400 

tttttcacgt ccctttctga gagctggcgg tcaatggaga gcatcaggtg tttgtatgga 119460 

ctaaagtaac tggggctgga ccggagcctc tcgataataa atcccacgtc tagtaaaaaa 119520 

aaggtggaga ccaggtccaa tcgcccaacg agaaacacgg cctccagcaa caccgggagg 119580 

ggaataccgg gttctccagt tgaggggcac agcccgtttg caaacccctc gggcgtatcg 119640 

tcggacgccg gccggtcaaa aagccacagc acggcctctc tgtcgtctgc ctctaggtgg 119700 

cgaccgaagt ccactaaccg cttatgcggg aacatgggtc cggtgcggcg ctcgccgaca 119760 

ccctcttcta cctacactag agcgtttgat aaatcacgac gaaccgcggg cgttacttaa 119820 

aatgtggtct tcgtcaaatt cccgcaaggt ggtggcaacc gactcggcgg cggccgcgag 119880 

ggttgcggcc gaaacgccga cggcctccgc caatctggcc ggaggcggcg cagcggcacc 119940 

gaggctaaac agcgcgcagg cggccgccac cagggacggg ggcagaccgc cggtggccgg 120000 

gttcacgatg gctttgtgaa cagactccac cacctggctg tgaagggcaa agagctgctc 120060 

ttttgtaaag ccgcttttaa acagggtggg cccgatggcg tccgttggta agacggcctc 120120 

tagtttccac cttaaagcct taaggattga tttttcctgt ttcagtaaat ccgccacgga 120180 

aaaatcctcc gccgcgcaaa agcataaaaa ggcggccttg aaaggattca ggtctctaat 120240 

cttgccggct aaaaatagac aggcggcccc cagcctctga aaccgccgac gggggatgct 120300 

gcgacactct agatagcgat ccagaatact aacggccagt gggaaaacgc tagcatccgc 120360 

ctggtgggcc ctggcaacag accgcatcca cgtccccaag ataactcgca tgcccaccgt 120420 

cacctccgtt tgtatcgtcc catagcaggc cgtggatgtt acaaaggacg cttcgtgggc 120480 

caggaggtta gagagggccc ggtcctggta caggacgggg tcaatggtcc cggtgggcac 120540 

tgggccaaca gaagccattg tcaaggcaag gtaaagaaag cgatataagt cctacctgcg 120600 

gctcacggag gcttaatacg aacgtccggc ttaacacaca atcttctttg cgctctatgt 120660 

ccgctccaca ctggtcccat gtcttctgca gttcagccct gcctgttccc ggaaagcggc 120720 

ggactagatg caacgcggaa gcggctgttc atttaaggtg agtcacaggc ggggcaattt 120780 

gccaaccagt gacgcgtttc ctgcagtctg ggttttcacc tggcaagcgc cactggttct 120840 

tggcagccgg ccggtgggga tttttttagt gctgaattgg cagtcctctg tccatgcgca 120900 

ctatgctggc ctggaagtgg ggggcggatg ggtgatgtcg ttcccaccgg cgacgggcct 120960 

ttttgacggc cttggcggtt ccgcggcttt gacagaacac cgtaatgcaa aaagccggga 121020 

tgacgggact gccaggtgcg tccggcaaaa cgcggacgct ggatgctcta atgccgggaa 121080 
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tcgggtcggc gttgcccgtg tggtagttca gggcgatgcg ctgctgctgg ttgagatggt 125340 

actccatggc gtcgcgcggg tatctcacgc ccaggtaccg gccgttggcc caccctggga 125400 

ggacgaggcc ccgaagaacc ctaaacatga tgctgatggt ggtctggggg atgtggaggt 125460 

ttagccagag gcactcgtgg ttccctgatg cgttctcctc caggtggatg tcccactggt 125520 

cgggggtttt gggtccgggc gtgtcgtgag gggtctctct aagaagaccg agcgccccca 125580 

ggagctggaa cccaaactcc ccgcagcata gtgaaaatgt atccgctcgg cggaagaagg 125640 

ccataaggcc cccatagcac ccagggtcgt tgagaagacc catgatcgcg catcgggccc 125700 

ccacgtagct gtcttcgatg cccacggttc caccgatggt cagaccggag aatccccgga 125760 

ggatgttccc tcctctaagg tcgtcggtgg agacggccgc gacgtcgaac ccgacgttgg 125820 

tgaaggcggc catcagcgcc ctgggaagcg gggcaccggg ggtgaccaag gcggccactg 125880 

ctggcggcct cgacggcgtt gcaaacagag tcagttcgct gtttctgcaa acctcggcga 125940 

ggtggcccag gttgtgttgg ttgcacccgt agtcctttct gtagagttcc tgcgcgggcg 126000 

tgaagctggg cccccatgag taccactgtt cgtccgagaa cgaggtccag tttgccgcca 126060 

ccgaagtgag ggtctgtgaa tagacttcat cgttgttgtg tgagatgacg attctttctg 126120 

acagaccctc ctgacccacg gtgccgcaca cggtggcccg gcagtcaaag ttttggcacg 126180 

cctggcgcac ctcgtcgacg tgctggggct ggatttcgaa gatgacgccc ggggtttcgg 126240 

acaccagcca ctgcaggggc gtttcctctg acgggatgcg aatctgtagg cctcggttac 126300 

cggccagggc catctcgata gcggtaacca ccatgcctcc gtcgctgacg tcgtggccgg 126360 

acaccaccag accccgggag agcagggcct ctgttagcat gaagaggttg gccaggtgtg 126420 

tcgcctgaac gtccggtagg gtgggactcg gcaagagaca caggtgctca aaggtcgatc 126480 

cctgggtcag gtgcggcacg gggaaggaca ccagcacgat gaggttaccg gtggccttca 126540 

ggtcaggtgt gacgcgttgc ctggacgact tcacctcggc ggtggccgtg atgacgactg 126600 

cgttaaacgg cacgggggct accgtctgat gctgttgggt ggcgctgatc agttcttccg 126660 

acaggcacgg ggtgctgccg gccgacgtga tggcgaagtt gatgctcagg tccctgcata 126720 

gttccttgca ggcaaacagt gtgtgctgca ggagccaggc ctggtcgtcc tccgggttcc 126780 

aaccgaccga cgccgtcagc gtgatgtcgc ttagacgccg cacgtgcgct agcatgatgt 126840 

tggtgaccgc ctcgcagatg gcgtatctcg cgcccactgc cgcgttgatg cccatcttgt 126900 

atgcctgttc tccataagcc acgcactgtc cgacccaccg gttcctgccc ctgccctggg 126960 

tgacccaggt ttctgggcta accaggcgtt ccgtcgcgtc ctgcctggtg atcggctggg 127020 

ggcgatatat gggcacccgc gccattctgt ccgtaaacac cgacgtgtgg ttgacaatgt 127080 

ggtaatctga cagcggccgg cccaggggtc ccacttcgca ctgctgagcc acgagtccgt 127140 

tggagcatct gtccacgtgt cgcgtgacaa actctttgct gccaaccgtc ggacatcgca 127200 

gaagctgatc gaccacggag gccaggcgga ataggcccca gtcggtcggc gagggtgcct 127260 

gttcgccacc atcctcaggt tcttctgggc gcaccagcca gttggaggaa acgggctgct 127320 

cgtcgaaccc aaaagttgcg atgacttcgg ccgatgttgg ctgtctgatc ggccacgctc 127380 
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ccgggttcca cagctccagg tcgttcacga actgaattcc aagctcttcg caggtttccc 1274 40 

caacgacggc gaaagggcat ccgcagatgc gtgcggccct cctgatggca tccaacgggc 127500 

ccggttccgg ctccgtgtcg cgcgggagga cggcaaagac cacgcaggca gacacgttga 127560 

gaaacttgtt aactaggatg tcgttcacgg tttcccgtga gatgtctggc gtggcggtca 127620 

gttggctcag aatggttggt ggaagctgag agacaaagag gcgcaggccg ccccgcgaga 127680 

ccagggctcc caggtggccg atgacagagg cgggttcatg ggcgcgatta atgctggtaa 127740 

ataggggccc ttccaaaatc tcgtaagcca gcgacaacgt ttgggtgatg cgggtaatac 127800 

ctgcggcgct ccccaggtaa agcgagcgct ggttgccatc gccgtcaacg ggttcaaaag 127860 

acccaagaca gatgataggc tgactgaata aagacctcct gaggaatgtg tatgcttgag 127920 

gcacgcacga taacagcgat gtggtggtaa taacgttcgc ctcgctggtg gtggcgctgt 127980 

ggacggtttt aagaaagcca ccgaggacag gcgtgccggt gcgcgccagc tgctggcagt 128040 

ggagtgtcgc tgttgtgtta atagctctgt ggcgtgcgcg agaaaatgcg cccactgacg 128100 

gcgtagtggc gtaaaatcca cacagggccg atctgatttc gcccccgggc agcagcgtgg 128160 

catggaacag aatgttttgc agcgcacctg cacctgtctg aatggcgctg tgcaggtgtg 128220 

taaatgagtc agctatgtgg gttacgtttt cgcacgggag gcgctggaaa acgcagctgt 128280 

agctcctgcg ggccatgacg gcgtatctgg acaccgtata gctgttgccg ttggaggcct 128340 

gcacggccag cggtaatatg ttccgttgta acgggagcat caccgcggcg catattgggt 128400 

ctctctcggg caacggtctc caggcggtgg attgcaatgt caccgggagc tggcgctgca 128460 

cgcgcacatc atattctact agtgtgccaa gtgtcagggc ggccaaggtg cccgggggta 128520 

cccgatcggc gacaaaccgt ctcgtctggc agctcagatg tctgcccacc tctaaccgca 128580 

acgttctatc gaggccgctg tctctaaggc actgcactat ttcgctagat cttgtggtag 128640 

gttggcgcat caggtctggg ccatagggca gggatatagt atgatggccc tggaggtgcg 128700 

ccgaccgagt ttccagaggt gcataattaa ccagaggcgc cagaagatgc cagataacgc 128760 

ccagcgctgg gtcgtcatcg tcgttgtcaa agttatcaga ggggcgtgtt acaagccaaa 128820 

gtaaaaattc actaaagcac tgttctggca caagcgccaa aacactttct gcgtccgaat 128880 

tgtcatgaat aaaagcctcc tcttctggac taagggctgc agcggcccag cgagggttcg 128940 

tgcgttgggc catatttgct aagtctagcc ggcgcaaaag caagcgggtg ggactttaat 129000 

agggcgtata ggggttgtta ggtatgcggt ggggagccgt ccatgttaca tggacttcaa 129060 

tattaattta tgttagtttt gggcaccttg ggatgttaca cttataactt cccccatgtc 129120 

acagtaatca cttgacagat ccccttctaa ctcataacaa ctgtttcggg gtgttgaggt 129180 

tacaggtaag ggtgggaggg gaatgttagg tggcgaacca aagcataacg tcgcggggcg 129240 

ggagtgggag cctgtaaccc cagagatgtc acttacgctt gttaggcgat cgacaactac 129300 

ctccgaaagt gtgtcctgtt ggatgttttg tggattaggt aaaacagaaa gttgtggaga 129360 

aagtaacctt atagatgacc tcctgctggc gcgtggctgg gtgacagctg gttgttgatg 129420 

aatgaacact tcctcgggcg ttcctggggt ggtgggtgtg gtctcactaa ggacggccaa 129480 
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ctcaacacta acttcctccc gttcagaccg tgagcggttt cgtctgtagc gtcttgtgga 129540 

gggggttgcc tgaacccttg ttggtgtaaa caatccaggt aaccggtaaa tgcacgcggt 129600 

cagggcgatt tgtaacaaac ctaaaaacgc ataataagct tacaaacgca tttttggtaa 129660 

cagtttgtat aatttagccc caaagtttat acaaagcgaa actacttacc gattagtaaa 129720 

atgcaaaccg caagaaggat aacactcact gattctgttc caattggttc gaatttaaga 129780 

atggagagta aatttgcccc gctgagaatg cctggaaata agaggcgcgt attttacatg 129840 

tggtacaggg tagaaaacct tgtaggtttc agttgagtca ccctgccacg taaacccgcc 129900 

tagagtgctg cacgcggtgt tggttgcagg aagtacatcc taattttaac tctaaaaggg 129960 

gttccttgta aaattattta agttatggtg tacttacata atatccaatg agtagttaca 130020 

ctccaaaaag atactcgata aaatgcgtga tatgtgaagt ttgcaacaaa tgcagcggtg 130080 

agtaggccca agaaaatgat aaaccctata tgaggaaatt ttattagtac gtttgtgtaa 130140 

ctgcaaaata aaattttaaa taaatacttt atattcatgt taccaagttt aatttgtggt - 130200 

ggttgttctt gaattatatg tactcctagt tctgaggcaa ctgtatgtag tgcaagcacg 130260 

ctgccttgca acataaccaa taggcatgtt ccatgattcg atattcgcca tgccatttta 130320 

agtgccacgc agattgcagc aatagctata aaaaaacaaa agtatgttaa tagtctataa 130380 

agtataaaac attaaaacaa aattaaaaaa ttaaatacct gtatttgccg taactataga 130440 

gtatgataat agtaaggtgt aagactgtgt tgttgtagtt tctgttaatg cataaagtcc 130500 

cacaagtgct aaaataaaaa aataaatgtt agttatttaa tttttaaaag tttatactat 130560 

tcaagtaatt caaagttgtg tcatacctgc gtaaattgca aaccaagatc caattattct 130620 

tagtatgttt gtttttccac atagccacac ataaccataa aacacaatgg gcattaaaat 130680 

tactggcaac agcgcagata ctttgtagtt tggagtccac ataagaaatg ccgtaagaat 130740 

aacagtgcct atccatgttg gtcccacaag aagacatgat gttgttttac tagaaacaat 130800 

tttcggtggt tttataacaa atattaatat gcctgaaaaa tataaacaaa cgattaattt 130860 

aaggccaata tcatagtaaa acaaacttta aacaggtaaa aagttacgtc ttacctatgt 130920 

aaaattgaga tacgcatgca atagcactac acgtctgtac caaaatactt gcgttgtcca 130980 

aggccaatgc cgatagtaaa acacaacatg ctgcaaacaa gtacacaaaa tataagttta 131040 

atttgctaat taaaattaat atttcaaatg tattataaaa tcatacctgt aatccatggg 131100 

aaaaaaattg aaggcaacgg agttaattga cctgctctgc ttcgattaca atacaaacta 131160 

atagttttcc ataaaagaga acaacacgta acaaatgaac aaataattgt gactgcttga 131220 

ttagaagtgt ccctggcatg ttgtaccaag gctattatcc aaaacgttaa tgtaagaagg 131280 

tttgtaagta accaactact ccaaataaaa aaggtatagc tacacgccat tttaaaataa 131340 

tcaaaactgt attatcaagt taagggggag tttgagcttt aaaaggttgg ttacatcgtt 131400 

tccaggcaaa cctcacactg agtcacacaa aggtttctgt gtgcccctgt acgggaatct 1314 60 

gccttattgc tgattttgtt ggcaaccaac caaagaaatc atggaaaagc agagtgggaa 131520 

tatttgtttt tgttcgttgg cctggatgct tggaaacatt cctgttattt tgatgttagg 131580 
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ccaaactgtg gaaaacatta agagttgtta ttgaaaatat tttttggtaa caaaaccatg 131640 

ttttaggtcc caaaaggaac ataaaaatgt ttaatagaaa catgcctagt acagaacatt 131700 

ttgtttggta cgtaaactaa aaatatgtat ggtaagggct aggggtaagg gcaaggggta 131760 

agggctaggg gtaagggcaa ggggtaaggg caaggggtaa gggcaagggg taagggcaag 131820 

gggtaagggc aaggggtaag ggcaaggggt aagggcaagg ggtaagggca aggggtaagg 131880 

gctaggggta agggctaggg gtaagggcta ggggtaaggg ctagggtgag ctagggtgag 131940 

ctagggttat agttatagtg taaaataata ggcctacgat aaaatgtcaa catatatttt 132000 

ttaatttgtg tttcattaac agccatggtg tattagagta aaggttaacg cttcaataca 132060 

atatacaagt aaaagtaaac cacacgaatt tataacatat ttacaaaagc gaaccaagct 132120 

ggcacatgta acttcacgct accctaaacg tacaatactg ggactagaac ccagaggtag 132180 

ttagaatata cggtagttac agaactttgc agttccctta ggccagcagg gctctgcggt 132240 

taattaaaca aagtttaaag ataactgaaa ctttaggaag tgcgtatggg tgcaatgtgt 132300 

tccaaatagg gcaagggtta cataaactgt tgcctagcgg ccgggcccgg aggcgcccgg 132360 

ccggcgccgc cgccgggccg cggccgccat cttgcgcccg gggcgagggt cccccgcgcg 132420 

ccccccgggc ccgcgcgccg ggcgccgccg gcctcccccg gctcccggcc ctccgccccg 132480 

ccggcccggg cccgcggccg ccgcccccgg ccgccccgcg ccgcgcgccg ggccgctttc 132540 

ggttcgcggg gccgggggtc ccgcgggggg ccggggccgc ccccgggtgc cgctccgccg 132600 

ggcccggccg actcccggga gcgccccggt ccggccgagc ccggcggcgc cgcgccccga 132660 

cgccccccgg gcccggggcc ccacaagccg cggcgcgcgg ctcccgatgc cgggcggccg 132720 

ccgcccggca tggcggtcct ccgccggcct cccctccccc acgccgcccc gaaaggtggt 132780 

ctccgcgccg ccgggagggg ggccggggcc cggggcgcct cggcggggcc cggcgcgggg 132840 

cgcgaccgag ggccccggga gaacggggga tcgggaaaac gcgaggggag cgggggacag 132900 

gggacggcgt gtgcgtgctt gtgagacacc gggtacggct gcctgcctgc tcgctggcct 132960 

gcttgctgag gggacagtag gcctgcttgc tcgctggcct gcttgctgag gggacagtag 133020 

gcctgcttgc tgaggggaca gtaggcctgc ttgctcgctg gcctgcttgc tgaggggaca 133080 

gtagggctgc tggcttgcta gtagggctgc tcgctggcct gcttgcttgc tcgctggcct 133140 

gcttgcttgc tcgctggcct gcttgcttgc tcgctggcct gcttgcttgc tcgctggcct 133200 

gcttgcttgc tcgctggcct gcttgcttgc tcgctggcct gcttgcttgc tcgctggcct 133260 

gcttgctgag gggacagtag ggctgcttgc ttgctaaggg gacggtacgc ctgcctgatg 133320 

gcttgatagt agggctgctg ggctgctagt agggctgctg ggctgctagt agggctgctg 133380 

ggctgctagt agggctgctg ggctgctagt agggctcctg ggctgctagt agggctgctg 133440 

ggctgctagt agggctcctg ggctgctagt agggctgctg ggctgctagt agggctgctg 133500 

ggctgctagt agggctgctg ggctgctagt agggctgcct gctggcttgc ttgcttgctt 133560 

gctagtgggg ccgcttgcct gctactaggg ctgctgtgca gctgggagaa cagagtaggg 133620 

ctgccggcca gctgcgtgcg agggcgtccg agggccagac gaggacacgg gacccgggcc 133680 
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tctcccccgc ccggaccgcc gggcacccgg cccagatct 133719 

<210> 2 
<211> 426 
<212> DNA 

<213> Cosmid 3 Fragment 7 T7 Sequence 
<400> 2 

gaattcatat tgcatatgtc ctctatggac acagtttgta cacacaactg aaccatttgt 60 
tccggtgtga gatattgcat catcacccac gtacagaaca cggcgtggta cgcccacgct 120 
acctgtcgca agtagttttg cgtgattcct ttttcttcct tgcggcctcg actttgaatg 180 
cgcctgtgcg cctccgaact gttgagcgtt agcaaaacga ccacatcacc gtcgtgtgcc 240 
gagaaaagag ataggatttg aaacaaatga tccggcgtta gcctgttgta ctttacgtga 300 
accagcggaa aaacaacggt ggcggacagt aggtgtctgt caaaaacaat ccaattcgct 360 
ggctttgtgc taccgtttcc cactagccac ggttgcaggt tgcgaccgat gccggcggcc 420 
gttgca 426 

<210> 3 
<211> 141 
<212> PRT 

<213> Deduced Amino Acid of Cosmid 3 Fragment 7 T7 Sequence 
<400> 3 

Ala Thr Ala Ala Gly He Gly Arg Asn Leu Gin Pro Trp Leu Val Gly 
15 10 15 

Asn Gly Ser Thr Lys Pro Ala Asn Trp He Val Phe Asp Arg His Leu 
20 25 30 

Leu Ser Ala Thr Val Val Phe Pro Leu Val Rxa Val Lys Tyr Asn Arg 
35 40 45 

Leu Thr Pro Asp His Leu Phe Gin He Leu Ser Leu Phe Ser Ala His 
50 55 60 

Asp Gly Asp Val Val Val Leu Leu Thr Leu Asn Ser Ser Glu Ala His 
65 70 75 80 

Arg Arg He Gin Ser Arg Gly Arg Lys Glu Glu Lys Gly He Thr Gin 
85 90 95 * 

Asn Tyr Leu Arg Gin Val Ala Trp Ala Tyr Hi 3 Ala Val Phe Cys Thr 
100 105 110 

Trp Val Met Met Gin Tyr Leu Thr Pro Glu Gin Met Val Gin Leu Cys 
115 120 125 

Val Gin Thr Val Ser He Glu Asp He Cys Asn Met Asn 
130 135 140 

<210> 4 
<211> 524 
<212> DNA 

<213> Cosmid 3 Fragment 7 SP6 Sequence 
<400> 4 

gaattcaatg actggcatgg tgccgcagga aaacgtctta agctgtccag agcccatgaa 60 
attttggacg tgtgtttatt caaattgcct taaagagcag cgcagcatag ttaagcaagg 120 
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ca cccacggg aaatcgatca cttctgctcg cgtatacgcg tgtcagagca agtttgcgct 180 

accttttcgt gcaacggccg ccggcatcgg tcgcaacctg caaccgtggc tagtgggaaa 240 

cggtagcaca aagccagcga attggattgt ttttgacaga cacctactgt ccgccaccgt 300 

tgtttttccg ctggttcacg taaagtacaa caggctaacg ccggatcatt tgtttcaaat 360 

cctatctctt ttctcggcac acgacggtga tgtggtcgtt ttgctaacgc tcaacagttc 420 

ggaggcgcac aggcgcattc aaagtcgagg ccgcaaggaa gaaaaaggaa tcacgcaaaa 480 

ctacttgcga caggtagcgt gggcgtacca cgccgtgttc ttgt 524 



<210> 5 
<211> 174 
<212> PRT 

<213> Deduced Amino Acid of Cosmid 3 Fragment 7 SP6 Sequence 
<400> 5 

Asn Ser Met Thr Gly Met Val Pro Gin Glu Asn Val Leu Ser Cys Pro 
15 10 15 

Glu Pro Met Lys Phe Trp Thr Cys Val Tyr Ser Asn Cys Leu Lys Glu 
20 25 30 

Gin Arg Ser He Val Lys Gin Gly Thr His Gly Lys Ser He Thr Ser 
35 40 45 

Ala Arg Val Tyr Ala Cys Gin Ser Lys Phe Ala Leu Pro Phe Arg Ala 
50 55 60 

Thr Ala Ala Gly He Gly Arg Asn Leu Gin Pro Trp Leu Val Gly Asn 
65 70 75 80 

Gly Ser Thr Lys Pro Ala Asn Trp He Val Phe Asp Arg His Leu Leu 
85 90 95 

Ser Ala Thr Val Val Phe Pro Leu Val His Val Lys Tyr Asn Arg Leu 
100 105 110 

Thr Pro Asp His Leu Phe Gin He Leu Ser Leu Phe Ser Ala His Asp 
115 120 125 

Gly Asp Val Val Val Leu Leu Thr Leu Asn Ser Ser Glu Ala His Arg 
130 135 140 

Arg He Gin Ser Arg Gly Arg Lys Glu Glu Lys Gly He Thr Gin Asn 
145 150 155 160 

Tyr Leu Arg Gin Val Ala Trp Ala Tyr His Ala Val Phe Leu 
165 170 

<210> 6 
<211> 630 
<212> DNA 

<213> Cosmid 3 Fragment 5 T7 Sequence 
<400> 6 

agcgagtcmg tgagcgagga agcggaagag cgcccaatac gcaaaccgcc tctccccgcg 60 
cgttggccga ttcattaatg caggttaacc tggsktrtcg aaattaatac gactcactat 120 
agggagaccg gcctcgagca gctgaagctt gcatgcctgc aggtcgactc tagaggatcc 180 
ccgggtaccg agctcgaatt ccagacaggt aagattttag ctcctaatgt ttgccttgcc 240 
gcctagctcc taatgtttgc cttgccgcct agctcctaat gtttgccttg ccgcctagct 300 
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ttgtctgtgg taattgcgtt ccccgagggt 



<210> 7 

<211> 406 

<212> DNA 

<213> Cosmid 3 Fragment 5 SP6 

<400> 7 



atgkttgcct 


tgccgcctag ctcctaatgt 


360 


cttgccgcct 


agctcctaat gtttgccttg 


420 


c i" a a r* +* r* ci~ a 


CI Cy L. U L> 1^1*1^1* ctl*-y LLLaLUL 


Ann 


gttggttttt 


atgaccggct tggtacaaaa 


540 


aataaataaa 


aaagttaaaa cttatttctg 


600 






630 


gccaaggcga 


tcgacgccgg atttatccga 


60 






i on 


cacggcggta 


accgcgacac ccaaaccatg 


180 


cccaagagca 


cctttctaac catggtgcag 


240 


caacgcgcgt 


acgtttccgc gccaccctcg 


300 


ataccgccac 


cagaactgac cgtcccgtcg 


360 


tttcagtcgg 


cgtttg 


406 



<210> 8 
<211> 135 
<212> PRT 

<213> Deduced Amino Acid of Cosmid 3 Fragment 5 SP6 
<400> 8 

Glu Phe Ala Ala Pro Leu Glu Thr Leu Met Ala Lys Ala lie Asp Ala 
15 10 15 

Gly Phe He Arg Asp Arg Thr Asp Leu Leu Lys Thr Asp Lys Gly Val 
20 25 30 

Ala Arg Val Ala Arg Ser Thr Tyr Leu Lys Ala Ser Gin Ser Pro Ser 
35 40 45 

Ser Gin His Gly Gly Asn Arg Asp Thr Gin Thr Met Ser Ala Leu Pro 
50 55 60 

Asp Asp Asn He Thr He Pro Lys Ser Thr Phe Leu Thr Met Val Gin 
65 70 75 80 

Ser Ser Leu Asp His Met Arg Asn Gin Gly Gin Arg Ala Tyr Val Ser 
85 90 95 

Ala Pro Pro Ser Met Pro Ala Thr Ala Ala Tyr Pro Ser Trp He Pro 
100 105 110 

Pro Pro Glu Leu Thr Val Pro Ser Tyr Ala Pro Pro Val Ala Pro Pro 
115 120 125 

Phe Pro Phe Gin Ser Ala Phe 
130 135 

<210> 9 
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<2U> 630 

<212> DNA 

<213> Cosmid 3 Fragment 3 T7 
<220> 

<221> misc_feature 

<222> ()..() 

<223> n = any nucleotide 



<400> 9 
grgcgcagcg 


agtcagtgag 


cgaggnaagc 


ggaagagcgc 


ccaatacgca 


aaccgcctct 


60 


ccccgcgcgt 


tggccgattc 


attaatgcag 


gttaacctgg 


sttatcgaaa 


ttaatacgac 


120 


tcactatagg 


gagaccggcc 


tcgagcagct 


gaagcttgca 


tgcctgcagg 


tcgactctag 


180 


aggatccccg 


ggtaccgagc 


tcgaattcca 


gattgactca 


tcggtttcta 


accctaacaa 


240 


agttgcatga 


acagagtatg 


ataccaatgg 


tggtagaaat 


gttagcagcg 


gttaaagaac 


300 


acgtgacctt 


aatggaggtc 


tgtttgggcc 


tctttaaaga 


gctacgaaag 


cttcagattt 


360 


taattgttga 


cgcgggagaa 


catttagatg 


atacgtgtgg 


cctttgggga 


aatatttatg 


420 


ggcaggtaat 


gtcaaatgag 


gctattaaac 


cacgagccgt 


gaactggcca 


gcccttgaaa 


480 


gctacattca 


aacgctaacc 


agcttggaaa 


gcaatgcagc 


caattaacca 


cggtttgtgt 


540 


ttgttggtaa 


ttacattctc 


tgtaattaat 


ggatatgaat 


ataatgaaga 


aaatgtacct 


600 


ggacttgaaa 


tagttctttt 


tacccctgcc 








630 



<210> 10 

<211> 124 

<212> PRT 

<213> Deduced Amino Acid of Cosmid 3 Fragment 3 T7 Sequence 

<400> 10 

Ser Leu His Ala Cys Arg Ser 
1 5 

Ser Asn Ser Arg Leu Thr His 
20 

Glu Gin Ser Met lie Pro Met 
35 

Glu His Val Thr Leu Met Glu 
50 55 

Arg Lys Leu Gin lie Leu He 
65 70 

Thr Cys Gly Leu Trp Gly Asn 
85 

Ala He Lys Pro Arg Ala Val 
100 

Gin Thr Leu Thr Ser Leu Glu 
115 

<210> 11 

<211> 524 

<212> DNA 

<213> Cosmid 3 Fragment 3 SP6 



Thr Leu Glu Asp Pro Arg Val Pro Ser 
10 15 

Arg Phe Leu Thr Leu Thr Lys Leu His 
25 30 

Val Val Glu Met Leu Ala Ala Val Lys 
40 45 

Val Cys Leu Gly Leu Phe Lys Glu Leu 
60 

Val Asp Ala Gly Glu His Leu Asp Asp 
75 80 

He Tyr Gly Gin Val Met Ser Asn Glu 
90 95 

Asn Trp Pro Ala Leu Glu Ser Tyr He 
105 110 



Ser Asn Ala Ala Asn 
120 
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<400> 11 

gaattcaata cgctccaaaa gatagagatc acaattttat ttttgacgcc aatcaaaacc 60 
cagatcgaca taaacaagtg caccacgacc accagaccga accgcttcca gacatgttcg 120 
atccagtaaa gcacctaagt ttacacaact tcaaaatctc agttttcaat accaatatgg 180 
taattaatac caaaatcacg tgccggtctc tcaccggtac cttcgagtica atcatcgaca 240 
tccccaggct cacaaataac tttgtaatga aaaaattctc cgtgaaagaa ccgtcattca 300 
ctgtgagcgt gttctactcc gacaacctgt gcaacggggc tgcgattaac gttaacataa 360 
gcggggacat gctgcacttt atgttcgcta tgggaaacct gaggtgcttt ttgccggtga 420 
agcacatttt cccggtttcg attgcgaact ggaactctac gttagacctc cacgggctcg 480 
aaaaccaata catagttaga cgggggcggc gagacgtttt ctgg 524 

<210> 12 

<211> 174 

<212> PRT 

<213> Deduced Amino Acid of Cosmid 3 Fragment 3 SP6 

<400> 12 

He Gin Tyr Ala Pro Lys Asp Arg Asp His Asn Phe He Phe Asp Ala 
1 5 10 15 

Asn Gin Asn Pro Asp Arg His Lys Gin Val His His Asp His Gin Thr 
20 25 30 

Glu Pro Leu Pro Asp Met Phe Asp Pro Val Lys His Leu Ser Leu His 
35 40 45 

Asn Phe Lys He Ser Val Phe Asn Thr Asn Met Val He Asn Thr Lys 
50 55 60 

He Thr Cys Arg Ser Leu Thr Gly Thr Phe Glu Ser He He Asp He 
65 70 75 80 

Pro Arg Leu Thr Asn Asn Phe Val Met Lys Lys Phe Ser Val Lys Glu 
85 90 95 

Pro Ser Phe Thr Val Ser Val Phe Tyr Ser Asp Asn Leu Cys Asn Gly 
100 105 110 

Ala Ala He Asn Val Asn He Ser Gly Asp Met Leu His Phe Met Phe 
115 120 125 

Ala Met Gly Asn Leu Arg Cys Phe Leu Pro Val Lys His He Phe Pro 
130 135 140 

Val Ser He Ala Asn Trp Asn Ser Thr Leu Asp Leu His Gly Leu Glu 
145 150 155 160 

Asn Gin Tyr He Val Arg Arg Gly Arg Arg Asp Val Phe Trp 
165 170 

<210> 13 
<211> 398 
<212> DNA 

<213> Cosmid 3 Fragment 1 T7 
<400> 13 

gatcccatag agtccctgtt ctgcggcggt ctttttaact ctatagacga taccattaac 60 
gcactgagcc gggattgttc ggtaacgttc tttcaacagg caaactatac caacgtcatg 120 
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cgaaaacaaa acgagctgtt taccagactc aacagcatat tgtgtcaggg gagcgcggga " 180 

tcgcaaaaac cggccacccc ctcggagcca cggaccgcca ccgtcgcggc gaccgcggca 240 

agcgacgtca ttaaagacgc acaatatcgc aaagaacagt acatgaaaaa ggtggccagg 300 

gacggcttta aaaaactaac agagtgtctg cagacgcaaa gcgcggtgtt ggcaaacgcg 360 

ctctgcatgc gcgtatgggg gggcgtcgca tacggcga 398 

<210> 14 
<211> 132 
<212> PRT 

<213> Deduced Amino Acid of Cosmid 3 Fragment 1 T7 
<400> 14 

Asp Pro lie Glu Ser Leu Phe Cys Gly Gly Leu Phe Asn Ser lie Asp 
15 10 15 

Asp Thr lie Asn Ala Leu Ser Arg Asp Cys Ser Val Thr Phe Phe Gin 
20 25 30 

Gin Ala Asn Tyr Thr Asn Val Met Arg Lys Gin Asn Glu Leu Phe Thr 
35 40 45 

Arg Leu Asn Ser lie Leu Cys Gin Gly Ser Ala Gly Ser Gin Lys Pro 
50 55 60 

Ala Thr Pro Ser Glu Pro Arg Thr Ala Thr Val Ala Ala Thr Ala Ala 
65 70 75 80 

Ser Asp Val He Lys Asp Ala Gin Tyr Arg Lys Glu Gin Tyr Met Lys 
85 90 95 

Lys Val Ala Arg Asp Gly Phe Lys Lys Leu Thr Glu Cys Leu Gin Thr 
100 105 110 

Gin Ser Ala Val Leu Ala Asn Ala Leu Cys Met Arg Val Trp Gly Gly 
115 120 125 

Val Ala Tyr Gly 
130 



<210> 15 
<211> 524 
<212> DNA 

<213> Cosmid 3 Fragment 1 SP6 








<400> 15 
gaattcacac 


ccgaacaaat ctaaaagcaa 


aattacacaa attaacaaaa 


atacaactat 


60 


aacacagtta 


aaaaattcaa tacataatta 


caatggtaaa attcacgcgt 


acataattac 


120 


aattttaaat 


tcaatttaaa aattcacgtg 


tacacaatta caaaattcac 


acaattttaa 


180 


aatcctttta 


accataaaat tagaacggca 


ctgtatatat gcagcgtatt 


gcaaactggc 


240 


agctttgtgt 


taaagaccac ggcattaaat 


tttaaattcg ccaggtcgcc 


aaaagtactg 


300 


gttacgttag 


ttcatctaaa agcttaacgt 


gcttgtgctt aaatttacgc 


cgtgccattg 


360 


ctccatttta 


aattaaaatt taaaatgtag 


tgcgagcgag ctagaaacag 


aaaccgcgac 


420 


gcggggcagg 


ttgcgtaggt tcgccgtttt 


tggctccagt gacaatccaa 


aagctgcggt 


480 


ttacgagcca 


ttgtttttgt caaccactaa 


accgaaagca tgcg 




524 
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<210> 16 
<211> 438 
<212> DNA 

<213> Cosmid 3 EcoRl Fragment 1 Kpn 1 Fragment 1 T7 
<400> 16 



ggtaccccca 


cgactataaa aagagacacc 


ctggtggccg ccgccgcgcc ctgccccgtg 


60 


gtgcgagtaa 


gctccgccga cgacgcgccg 


cgggatctcg tcgcgtcacc agacaccggg 


120 


gcgctctcca 


ttgacgcgtt cacaatcccg 


gtcggtctcc ccggggtggt ctcggcggag 


180 


tgtcacgtgt 


ctatgcgcga caacggggtc 


cacgaacgca tgagccatta acggcgacaa 


240 


tgggaacacc 


ggtgcgtttc tttcgcggcg 


agtggcaaac ctcgagtctg gtggacaacg 


300 


gcacgccacg 


gtacagctcc ctggtgtggg 


ccgccactat ccacgacggc tacctgacac 


360 


tggtgaacag 


gtcggagctg tgcgtcacgg 


agcggtctcc gtgtctgccg gcgtgcccca 


420 


gcatcgggag 


actggtcg 




438 



<210> 17 
<211> 76 
<212> PRT 

<213> Deduced Amino Acid Cosmid 3 EcoRl Fragment 1 Kpn 1 Fragment 1 T7; C - Ter 
minus of Protein 

<400> 17 

Gly Thr Pro Thr Thr He Lys Arg Asp Thr Leu Val Ala Ala Ala Ala 
15 10 15 

Pro Cys Pro Val Val Arg Val Ser Ser Ala Asp Asp Ala Pro Arg Asp 
20 25 30 

Leu Val Ala Ser Pro Asp Thr Gly Ala Leu Ser He Asp Ala Phe Thr 
35 40 45 

He Pro Val Gly Leu Pro Gly Val Val Ser Ala Glu Cys His Val Ser 
50 55 60 

Met Arg Asp Asn Gly Val His Glu Arg Met Ser His 
65 70 75 

<210> 18 

<211> 497 

<212> DNA 

<213> Cosmid 3 EcoRl Fragment 1 Kpn 1 Fragment 2 SP6 
<220> 

<221> mis cofeature 

<222> ()..() 

<223> n = any nucleotide 



<400> 18 



ggtaccgagc cgcttccaga cggacattat tccgagcgga accgtgctca 


agctcctggg 


60 


gcgaaccgag aacggcacca gcgtgtgcgt gaacgtgttc cgtcaacagg 


tatatttcta 


120 


cgcgaaggtc ccagccggcg tcaacgtcac ccacgtcctc cagcaggccc 


tcaagaacac 


180 


cgccggcagg gccgcgtgcg gcttctcgac cagacgagta accaaaaaaa 


ttctcaaaac 


240 


gtacgacgtc gcggagcatc ccgtcactga aatcacgcta tcgtccggtt 


ccatgctctc 


300 


gaccctcagc gaccgcctcg tcgcgtgcgg gtgcgaggtg ttcgagtcaa 


acgtggacgc 


360 
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cgttcgccgg ttcgtcctgg atcacgggtt taccacgttc gggtggtact cgtgcgcgcg 420 

tgccacgccc cgcctagcgg ncagagatgc cagaacggcc ctggagtttg actgcagctg 480 

ggaggacctc agcgttc 497 



<210> 


19 




<211> 


165 




<212> 


PRT 




<213> 


Deduced Amino 


Acid of 


<220> 






<221> 


misc feature 




<222> 


<>..<) 




<223> 


X = any amino 


acids 


<400> 


19 





Val Pro Ser Arg Phe Gin Thr Asp lie lie Pro Ser Gly Thr Val Leu 
15 10 15 

Lys Leu Leu Gly Arg Thr Glu Asn Gly Thr Ser Val Cys Val Asn Val 
20 25 30 

Phe Arg Gin Gin Val Tyr Phe Tyr Ala Lys Val Pro Ala Gly Val Asn 
35 40 45 

Val Thr His Val Leu Gin Gin Ala Leu Lys Asn Thr Ala Gly Arg Ala 
50 55 60 

Ala Cys Gly Phe Ser Thr Arg Arg Val Thr Lys Lys He Leu Lys Thr 
65 70 75 80 

Tyr Asp Val Ala Glu His Pro Val Thr Glu He Thr Leu Ser Ser Gly 
85 90 95 

Ser Met Leu Ser Thr Leu Ser Asp Arg Leu Val Ala Cys Gly Cys Glu 
100 105 110 

Val Phe Glu Ser Asn Val Asp Ala Val Arg Arg Phe Val Leu Asp His 
115 120 125 

Gly Phe Thr Thr Phe Gly Trp Tyr Ser Cys Ala Arg Ala Thr Pro Arg 
130 135 140 

Leu Ala Xaa Arg Asp Ala Arg Thr Ala Leu Glu Phe Asp .Cys Ser Trp 
145 150 155 160 

Glu Asp Leu Ser Val 
165 

<210> 20 
<211> 314 
<212> DNA 

<213> Cosmid 3 EcoRl Fragment 1 Kpn 1 Fragment 2 T7 
<400> 20 

ggtaccgttg gggttgaata ggcgcaggtg tgttgtgcaa ccggaaagca aaagtgttgg 60 
ggtgacgatg aataggggct ccggtgccac gcccgcaact agaaacgcca gatcgccccg 120 
agcggcgctc cctgttggaa agattaactg aacccggact tccgcggagc ttccaggttg 180 
gatggttttt tcggcgtcgg cgtacacccc caatacgcgg ccgccggtgt gggtgggcag 240 
gacgggtccg gactggccca agactatctc ggcgttgggg tcccccggag cgtaaatgat 300 
tttcatttgc gccg 314 
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<210> 21 
<211> 104 
<212> PRT 

<213> Deduced Amino Acid of Cosmid 3 EcoRl Fragment 1 Kpn 1 Fragment 2 T7 
<400> 21 

Ala Gin Met Lys lie He Tyr Ala Pro Gly Asp Pro Asn Ala Glu He 
15 10 15 

Val Leu Gly Gin Ser Gly Pro Val Leu Pro Thr His Thr Gly Gly Arg 
20 25 30 

Val Leu Gly Val Tyr Ala Asp Ala Glu Lys Thr He Gin Pro Gly Ser 
35 40 45 

Ser Ala Glu Val Arg Val Gin Leu He Phe Pro Thr Gly Ser Ala Ala 
50 55 60 

Arg Gly Asp Leu Ala Phe Leu Val Ala Gly Val Ala Pro Glu Pro Leu 
65 70 75 80 

Phe He Val Thr Pro Thr Leu Leu Leu Ser Gly Cys Thr Thr His Leu 
85 90 95 

Arg Leu Phe Asn Pro Asn Gly Thr 
100 



<210> 22 
<211> 681 
<212> DNA 



<213> Cosmid 3 EcoRl 


Fragment 1 


Kpn Fragment 3 SP6 




<400> 22 
ggtaccgcaa 


ggtggccacc 


tcggtaaccg 


tctatcgagg gtggaccgag accgccgtga 


60 


ccggtaagca 


agaggtcatt 


cgaccggtgc 


cgcagtacga gattaaccac atggacacga 


120 


cctaccagtg 


tttcagctcc 


atgcgcgtga 


acgttaacgg catcgaaaac acctacacgg 


180 


acagggactt 


cactaaccag 


accgtgttcc 


tgcaaccggt cgaggggctc acggataaca 


240 


ttcagcgata 


cttcagtcag 


ccggtgctgt 


acacgacacc gggatggttt cctggaatct 


300 


acagggtccg 


aaccacggtc 


aactgcgaga 


tcgtggacat gatcgcgcgt tcggcggaac 


360 


cgtactctta 


ctttgtcacc 


gccctgggag 


acacggtaga ggtatcgccg ttctgcttaa 


420 


acgactcgac 


gtgctccgtc 


gctgataaag 


ccgaaaacgg cctcggcgtg cgcgtgctta 


480 


caaattacac 


cattgttgac 


ttcgctaccc 


gcacgcccac caccgaaacg cgagttttcg 


540 


cagactcggg 


agaatacacc 


gtatcgtgga 


aggcggaaga ccctaagtcg gcagtctgtg 


600 


cgctgacgct 


ctggaaaacc 


tttcccaggg 


ccatacagac gacgcacgaa agccagctac 


660 


cactttgtgg 


ccaacgacgt 


g 




681 



<210> 23 
<211> 226 
<212> PRT 

<213> Deduced Amino Acid of Cosmid 3 EcoRl Fragment 1 Kpn Fragment 3 SP6 
<400> 23 

Tyr Arg Lys Val Ala Thr Ser Val Thr Val Tyr Arg Gly Trp Thr Glu 
15 10 15 
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Thr Ala Val Thr Gly Lys Gin Glu Val lie Arg Pro Val Pro Gin Tyr 
20 25 30 

Glu lie Asn His Met Asp Thr Thr Tyr Gin Cys Phe Ser Ser Met Arg 
35 40 45 

Val Asn Val Asn Gly lie Glu Asn Thr Tyr Thr Asp Arg Asp Phe Thr 
50 55 60 

Asn Gin Thr Val Phe Leu Gin Pro Val Glu Gly Leu Thr Asp Asn lie 
65 70 75 80 

Gin Arg Tyr Phe Ser Gin Pro Val Leu Tyr Thr Thr Pro Gly Trp Phe 
85 90 95 

Pro Gly lie Tyr Arg Val Arg Thr Thr Val Asn Cys Glu lie Val Asp 
100 105 no 

Met lie Ala Arg Ser Ala Glu Pro Tyr Ser Tyr Phe Val Thr Ala Leu 
115 120 125 

Gly Asp Thr Val Glu Val Ser Pro Phe Cys Leu Asn Asp Ser Thr Cys 
130 135 140 

Ser Val Ala Asp Lys Ala Glu Asn Gly Leu Gly Val Arg Val Leu Thr 
145 150 155 160 

Asn Tyr Thr lie Val Asp Phe Ala Thr Arg Thr Pro Thr Thr Glu Thr 
165 170 175 

Arg Val Phe Ala Asp Ser Gly Glu Tyr Thr Val Ser Trp Lys Ala Glu 
180 185 190 

Asp Pro Lys Ser Ala Val Cys Ala Leu Thr Leu Trp Lys Thr Phe Pro 
195 200 205 

Arg Ala He Gin Thr Thr His Glu Ser Gin Leu Pro Leu Cys Gly Gin 
210 215 220 

Arg Arg 
225 

<210> 24 
<211> 389 
<212> DNA 

<213> Cosmid 3 EcoRl Fragment 1 Kpn 1 Fragment 3 T7 
<400> 24 

ggtacctcgg cgcacctgtc cgccgcgtac gtcgtttcaa cgacgtcgta cgcgtgaaac 60 
gtaatggatt ggtcttgggg gtccgggacg actggccgtt ggccgcgcgc cgtccacatg 120 
gaccgttcgt gggcgagcag cacgtctccg cgcgcgccgt tttcaaagta cgtcggcggg 180 
aacgggatcg tgaccgggat catcccgcct gcccccggcg ttcggaggca ggcgggaatg 240 
agcctgcaga cgtctggtgg cggctgaacg gcgccggcgg gggcgggcgc atcggtgcat 300 
ttgtgtgagg gggggcgtgg tcctcgagga cccaggtacg ggttaaaaaa atccatgctc 360 
gatacagaac gctgacgccc gggggcgcc 389 

<210> 25 
<211> 129 
<212> PRT 

<213> Deduced Amino Acide of Cosmid 3 EcoRl Fragment 1 Kpn 1 Fragment 3 T7 Sequ 
ence 

74 



WO 01/88203 PCT/US01/16274 

Cloning of Radino virus Genome and Methods for its Ose.ST25 

<400> 25 

Gly Ala Pro Gly Arg Gin Arg Ser Val Ser Ser Met Asp Phe Phe Asn 
15 10 15 

Pro Tyr Leu Gly Pro Arg Gly Pro Arg Pro Pro Ser His Lys Cys Thr 
20 25 30 

Asp Ala Pro Ala Pro Ala Gly Ala Val Gin Pro Pro. Pro Asp Val Cys 
35 40 45 

Arg Leu lie Pro Ala Cys Leu Arg Thr Pro Gly Ala Gly Gly Met lie 
50. 55 60 

Pro Val Thr lie Pro Phe Pro Pro Thr Tyr Phe Glu Asn Gly Ala Arg 
65 70 75 80 

Gly Asp Val Leu Leu Ala His Glu Arg Ser Met Trp Thr Ala Arg Gly 
85 90 95 

Gin Arg Pro Val Val Pro Asp Pro Gin Asp Gin Ser lie Thr Phe His 
100 105 110 

Ala Tyr Asp Val Val Glu Thr Thr Tyr Ala Ala Asp Arg Cys Ala Glu 
115 120 125 

Val 

<210> 26 
<211> 330 
<212> DNA 

<213> Cosmid 3 EcoRl Fragment 1 Kpn 1 Fragment 4 T7 
<220> 

<221> misc_feature 

<222> ()..() 

<223> n — any nucleotide 



<400> 26 
ggtaccgcct 


aaccttaaag 


atgtggggga cgatattttt tttaaacacc attagaatgc 


60 


cttcttggtg 


cgttttgncc 


tcggtgccag gacacgtctt 


ttccaggtta 


aacctgaaga 


120 


gttcgccggt 


ggccgaggcg 


ctgcacacgc gaaacttgaa 


cgcctcggcc 


ctgggtgggt 


180 


tttcgggagg 


tgtgggcgtc 


gacgggcccg gcgtggactt 


tgcagtggtg 


gtcacgccct 


240 


ttggggtggt 


gacgttttcg 


ccaaccggcg cgccaatcgc 


gattatcacg 


acccacgccc 


300 


gcagcagacg 


acgcgtccgg 


ttaggtatca 






330 



<210> 


27 




<211> 


109 




<212> 


PRT 




<213> 


Deduced Amino 


Acide 


ence 






<220> 






<221> 


misc feature 




<222> 


().-() 




<223> 


X - any amino 


acid 


<400> 


27 





lie Pro Asn Arg Thr Arg Arg Leu Leu Arg Ala Trp Val Val lie lie 
15 10 15 
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Ala lie Gly Ala Pro Val Gly Glu Asn Val Thr Thr Pro Lys Gly Val 
20 25 30 

Thr Thr Thr Ala Lys Ser Thr Pro Gly Pro Ser Thr Pro Thr Pro Pro 
35 40 45 

Glu Asn Pro Pro Arg Ala Glu Ala Phe Lys Phe Arg Val Cys Ser Ala 
50 55 60 

Ser Ala Thr Gly Glu Leu Phe Arg Phe Asn Leu Glu Lys Thr Cys Pro 
65 70 75 80 

Gly Thr Glu Xaa Lys Thr His Gin Glu Gly He Leu Met Val Phe Lys 
85 90 95 

* Lys Asn He Val Pro His He Phe Lys Val Arg Arg Tyr 
100 105 

<210> 28 
<211> 140 
<212> PRT 
<213> Orf 29b 

<400> 28 

Asn Val Ala Val Glu Gly Asn Ser Ser Gin Asp Ala Gly Val Ala He 
15 10 15 

Ala Thr Val Leu Asn Glu He Cys Ser Val Pro Leu Ser Phe Leu His 
20 25 30 

His /Via Asp Lys Asn Thr Leu He Arg Ser Pro He Tyr Met Leu Gly 
35 40 45 

Pro Glu Lys Ala Lys Ala Phe Glu Ser Phe He Tyr Ala Leu Asn Ser 
50 55 60 

Gly Thr Phe Ser Ala Ser Gin Thr Val Val Ser His Thr He Lys Leu 
65 70 75 80 

Ser Phe Asp Pro Val Ala Tyr Leu He Asp Gin He Lys Ala He Arg 
85 90 95 

Cys He Pro Leu Lys Asp Gly Gly His Thr Tyr Cys Ala Lys Gin Lys 
100 105 110 

Thr Met Ser Asp Asp Val Leu Val Ala Thr Val Met Ala His Tyr Met 
115 120 . 125 

Ala Thr Asn Asp Lys Phe Val Phe Lys Ser Leu Glu 
130 135 140 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following r 
1. | | Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 



□ 



□ 



Claims Nos.: 

^ a ^ 3e ^u ey * r6late t0 l he '" tern f !° nal APP 1 ' 0 *' 0 " tha * <*> not comply with the prescribed requirements to such 

an extent that no meaningful International Search can be carried out, specifically: M 



Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 



Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 



This International Searching Authority found multiple inventions in this international application, as follows: 



1 . I"] As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
' searchable claims. 



ofarty*^ COU,d Searched without effort *n additional fee, this Authority did not invite payment 



3 ' t^^J^S^Z ? qUi [ ed ^f" 31 Bearch ,e ° 3 were timely paid by the applicant, this International Search Report 

covers only those claims for which fees were paid, specifically claims Nos.: 



^ r«w^d to tH d !!! 0n f -"7*1 were timely paid by the applicant. Consequently, this International Search Report i 
restncted to the invention first mentioned in the claims; it is covered by claims Nos.: 

1-3, 13, 14 (complete) 



Remark on Protest j~ ] The additional search fees were accompanied by the applicant's protest. 

| | No protest accompanied the payment of additional search fees. 



Form PCT/ISA/21 0 (continuation of first sheet (1 )) (Jury 1 998) 



INTERNATIONAL SEARCH REPORT 



International Application No. PCT/US 99/26260 
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1. Claims: 1-3, 13, 14 

An isolated virus (RRV) as deposited with ATCC as deposit 
accession number VR-2601, or having nucleic sequence shown 
in SEQ ID NO 1 or having at least 80% sequence identity to 
the nucleic acid sequence shown in SEQ ID NO 1; non human 
mamnal infected with said virus; 



2. Claims: partially 4-12, 15, 16, 17 

A purified protein having an activity of an RRV protein and 
comprising an amino acid sequence SEQ ID NO 3; an nucleic 
acid molecule encoding said protein or comprising the 
sequence SEQ ID NO 2; a recombinant molecule comprising a 
promoter sequence linked to said nucleic acid molecule; a 
cell transformed with said recombinant nucleic acid 
molecule; an oligonucleotide comprising at least 20 
contiguous nucleotides of the sequence shown in SEQ ID NO 2; 
an isolated nucleic acid molecule that hybridizes under 
stringent conditions with a nucleic probe comprising said 
oligonucleotide sequence and encodes a protein having an RRV 
protein biological activity; 



3. Claims: partially 4-12, 15, 16, 17 

Inventions 3 to 88 : Idem as subject 2 but limited to each 
of the designated proteins, where the amino acid sequences 
are respectively limitedo SEQ ID NO 5 to 165 and the 
corresponding nucleotide sequences to SEQ ID NO 4 to 164: 
For the sake of conciseness, the first subject matter is 
explicitly defined, the other subject matters are defined by 
analogy thereto. 



4. Claims: partially claims 4, 15, 16, 17, 18 

Invention 89 : An isolated nucleic acid molecule encoding at 
least one RRV protein, as far not covered by one of the 
previous inventions; an oligonucleotide comprising at least 
20 contiguous nucleotides of said sequence or an isolated 
nucleic acid molecule that hybridizes under stringent 
conditions with a nucleic probe comprising said 
oligonucleotide sequence and encodes a protein having an RRV 
protein biological activity, . 



5. Claims: 19-34 

Invention 90 :A method for testing the efficacy of a drug in 
the treatment of a condition associated with infection by 
RRV, the method comprising administering the drug to a 
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non-human primate infected with an RRV and observing the 
effects; method for producing a non-human primate model for 
testing potential treatments for a condition associated with 
RRV infection; a method for testing the efficacy of a 
candidate vaccine against RRV infection or conditions 
associated with it; 
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Applicant 

OREGON HEALTH SCIENCES UNIVERSITY et al. 



1 . This international preliminary examination report has been prepared by this International Preliminary Examining Authority 
and is transmitted to the applicant according to Article 36. 

2. This REPORT consists of a total of 7 sheets, including this cover sheet. 

15[ This report is also accompanied by ANNEXES, i.e. sheets of the description, claims and/or drawings which have 
been amended and are the basis for this report and/or sheets containing rectifications made before this Authority 
(see Rule 70.1 6 and Section 607 of the Administrative Instructions under the PCT). 

These annexes consist of a total of^sheets. 



3. 



This report contains indications relating to the following items: 

I S Basis of the report 
Priority 

Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 
Lack of unity of invention 

Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations su porting such statement 



II 


□ 


III 




IV 


□ 


V 




VI 


□ 


VII 


□ 


VIII 





Certain observations on the international application 



CORRECTED 
VERSION 



Date of submission of the demand 
03/06/2000 



Date of completion of this report 



20.02.2001 



Name and mailing address of the international 
preliminary examining authority: 
European Patent Office 

D-80298 Munich 
Tel. +49 89 2399 - 0 Tx: 523656 epmu d 

Fax: +49 89 2399 - 4465 



Authorized officer 

Surdej, P 

Telephone No. +49 89 2399 7334 



Form PCT/IPEA/409 (cover sheet) (January 1994) 



INTERNATIONAL PRELIMNARY 
EXAMINATION REPORT 



International application No. PCT/US99/26260 



I. Basis of the report 

1 . This report has been drawn on the basis of (substitute sheets which have been furnished to the receiving Office in 
response to an invitation under Article 14 are referred to in this report as "originally filed" and are not annexed to 
the report since they do not contain amendments (Rules 70. 16 and 70. 1 7).)\ 
Description, pages: 

1-64 as originally filed 

Claims, No.: 

1 -34 as received on 03/06/2000 with letter of 02/06/2000 

Drawings, sheets: 

1-13 as originally filed 

Sequence listing part of the description, pages: 

1-59, as originally filed 



2. With regard to the language, all the elements marked above were available or furnished to this Authority in the 
language in which the international application was filed, unless otherwise indicated under this item. 

These elements were available or furnished to this Authority in the following language: , which is: 

□ the language of a translation furnished for the purposes of the international search (under Rule 23.1 (b)). 

□ the language of publication of the international application (under Rule 48.3(b)). 

□ the language of a translation furnished for the purposes of international preliminary examination (under Rule 
55.2 and/or 55.3). 

3. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the 
international preliminary examination was carried out on the basis of the sequence listing: 

contained in the international application in written form. 

□ filed together with the international application in computer readable form. 

□ furnished subsequently to this Authority in written form. 

H furnished subsequently to this Authority in computer readable form. 

H The statement that the subsequently furnished written sequence listing does not go beyond the disclosure in 
the international application as filed has been furnished. 

H The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 
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□ 



the description, 
the claims, 



pages: 



□ 



Nos.: 



□ 



the drawings, 



sheets: 



5. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing such amendments must be referred to under item 1 and annexed to this 
report.) 

6. Additional observations, if necessary: 

III. Non-establishment of opinion with regard to novelty, inventive step and industrial applicability 

1 . The questions whether the claimed invention appears to be novel, to involve an inventive step (to be non- 
obvious), or to be industrially applicable have not been examined in respect of: 

□ the entire international application. 
IS claims Nos. 4-1 2,1 5-34. 



□ the said international application, or the said claims Nos. relate to the following subject matter which does 
not require an international preliminary examination (specify): 



□ the description, claims or drawings (indicate particular elements below) or said claims Nos. are so unclear 
that no meaningful opinion could be formed (specify): 



□ the claims, or said claims Nos. are so inadequately supported by the description that no meaningful opinion 
could be formed. 

H no international search report has been established for the said claims Nos. 4-12,1 5-34. 

2. A meaningful international preliminary examination report cannot be carried out due to the failure of the nucleotide 
and/or amino acid sequence listing to comply with the standard provided for in Annex C of the Administrative 
Instructions: 

□ the written form has not been furnished or does not comply with the standard. 

□ the computer readable form has not been furnished or does not comply with the standard. 

V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 



because: 
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1. Statement 



Novelty (N) 



Yes: 
No: 



Claims 1 
Claims 2-3,13-14 



Inventive step (IS) 



Yes: 
No: 



Claims 

Claims 1-3,13-14 



Industrial applicability (IA) 



Yes: 
No: 



Claims 1-3,13-14 
Claims 



2. Citations and explanations 
see separate sheet 

VIII. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 
see separate sheet 
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Reference is made to the following documents: 

D1 : DESROSIERS, R.C. ET AL: 'A herpesvirus of rhesus monkeys related to 
the human Kaposi's sarcoma-associated herpesvirus. 1 J. VIROL., vol. 71 , 
no. 12, December 1997 (1997-12), pages 9764-9869 

Sequence comparisons referred to in this report are already known to the applicant. 
Introduction 

The application discloses isolated rhesus macaque rhadinovirus and its complete 
genomic sequence. 

Re Item V 

Reasoned statement under Article 35(2) PCT with regard to novelty, inventive step or 
industrial applicability; citations and explanations supporting such statement 

Novelty (Art. 33(1) and (2) PCT) 

1 . Claims 2 and 3 are not novel. D1 discloses a strain of rhesus monkey 

rhadinovirus which is related to the Kaposi's sarcoma-associated herpesvirus, 
human herpesvirus 8. The virus described in D1 is very closely related to the virus 
described in the present application. The genome of the virus disclosed in D1 has 
the same organisation as the virus described in the present application over the 
known portion of the genome of the virus of D1 ; the open reading frames (ORFs) 
are in the same order with the same assigned function and e.g. a homolog of 
interleukin-6 is present at the same position (Fig. 2 and page 9766, right column, 
2nd paragraph). More importantly, the amino acid sequence of the DNA 
polymerase disclosed in D1 has a 100% identity in a 1014 amino acids overlap to 
sequence SEQ ID No. 15 of the present application (which is the entire length of 
the open reading frame; see also sequence comparisons). Such conservation of 
the amino acid sequence and genome organisation strongly suggests that the 
virus disclosed in D1 is a close variant of the virus disclosed in the application and 
falls into the scope of claims 2 and 3. Unless the applicant can convincingly show 
that the subject-matter of claims 2 and 3 is clearly different from the prior art, 
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novelty is denied for said claims. 

2. Claims 13 and 14 are not new. D1 discloses rhesus monkeys infected with 
rhesus rhadinoviruses (page 9764, right column, last paragraph to page 9766, left 
column, first paragraph) which fall into the scope of claim 2 (see point 1). It should 
be also noted that the non-human mammals referred to in claim 1 3 may eliminate 
the virus; thus, any non-human mammal falls into the scope of the claim. 

Inventive step (Art. 33 (1) and (3) PCT) 

3. The subject-matter of claim 1 is novel but it is not inventive. D1 discloses a 
rhesus rhadinovirus which has the same genome organisation over the known 
portion of its genome as the virus disclosed in the present application (Fig. 2). The 
DNA polymerase open reading frame disclosed in D1 is 100% identical in a 1014 
amino acids overlap to the the corresponding ORF of the present application 
(SEQ ID No. 1 5; see sequence comparisons). In the light of the prior art, the 
technical problem of the application is to provide an alternative way to isolate 
rhesus rhadinovirus genome and the solution is the provision of the rhesus 
rhadinovirus having the nucleic acid sequence SEQ ID No.1. D1 addresses the 
same technical problem as the present application. The same solution as the 
present application is disclosed in D1 with the isolation of e.g. one rhesus 
rhadinovirus isolate (H26-95; page 9766, left column, 1st paragraph). The mere 
disclosure of a variant of the rhesus rhadinovirus of D1 does not confer an 
inventive step to the subject matter of claim 1 . Thus, no inventive step is 
acknowledged for claim 1 . 

Re Item VIII 

Certain observations on the international application 

4. In claim 2, the sentence "a purified virus having a nucleic acid sequence shown in 
SEQ ID No. 1" is not clear since it cannot be seen which part of sequence SEQ ID 
No. 1 is present in the said purified virus. Hence, the scope of claim 2 is unclear 
(Art. 6 PCT). 

5. In claims 2 and 3, the length of nucleic acid sequence having 80% and 95% 
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identity to sequence SEQ ID No.1 , respectively, is not given. Thus, claims 2 and 3 
are not clear (Art. 6 PCT). 

6. The expressions such as "incorporated by reference" for example on page 17, line 
1 1 seem to imply that other subject matters are incorporated by reference, 
however, the application should be self-explanatory (Art. 5 and 6, Rule 9.1 iv) and 
Preliminary Examination Guidelines Ch. 11-4.17 PCT). 

7. Vague statements such as "spirit of the invention" for example on page 64, line 
36 imply that the subject matter of the invention may be different from what is 
defined by the claims, thereby resulting in lack of clarity (Art. 6 PCT) when use to 
interpret them (see also the PCT Preliminary Examination Guidelines Ch. Ill-4.3a). 
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CLAIMS AS AMENDED UNDER ARTICLE 34 

\^ 1. An isolated virus (RRV) as deposited with ATCC as deposit accession 



nurnber^YR-2601. 



2. \ 



A purified virus, having a nucleic acid sequence 

(a) shown in SEQ ID NO 1 or 

(b) a conservative variant thereof. 




3. The purified virus of claim 2, wherein the nucleic acid sequence has at 
least 95% sequence identityto the nucleic acid sequence shown in SEQ ID NO 1. 

15 4 - A purified protein encoded by an open reading frame of the virus of 

claim 2. 



20 



25 



30 



35 



5. A purified protein ofalaim 4, wherein the protein comprises an amino 
acid sequence selected from the group consisting of: 

(a) an amino acid sequence shown in odd numbered sequences of 
SEQ ID NOS. 3-165; and \ 

(b) amino acid sequences^that differ from those specified in (a) by 
one or more conservative amino'acid substitutions wherein the function 
of the protein is preserved. 



6. A purified protein with an amino acid sequence that is at least 95% 
sequence identity to the sequences specified in claim 5(a) oA(b) 

\ 

7. The purified protein of claim 5, wherein the amnio acid sequence is 
selected from odd numbered sequences within the group consisting of^SEQ ID NOS 3-19 
and 23-165. 



5. 



9. An isolated nucleic acid molecule according to claim 8, wherein thi 

molecule comprises a sequence selected from the group consisting of even numbered 
sequences of SEQ ID NOS 2-164. 




8. An isolated nucleic acid molecule encoding a protein according to claim 
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10. The isolated nucleic acid molecule according to claim 9, wherein the 
molecule comprises a sequence selected from the group consisting of even numbered 
sequences of SEQ ID NOS 2-18 and 22-164. 

^lNy^ A recombinant nucleic acid molecule comprising a promoter sequence 
operably linked to a nucleic acid molecule according to claim 8. 



12. A celUransformed with a recombinant nucleic acid molecule according 
10 to claim 8. \ 

\ 
\ 

13. A non-human mammal purposefully infected with the virus of claim 2. 

14. The mammal of claim 13, wherein the mammal is a primate. 

\\ 

15 15 - An oligonucleotide comprising a sequence selected from the group 



30 



consisting of: 



(a) at least 20 contiguous^nucleotides of the nucleic acid sequence of 
the virus of claim 2; \^ 

(b) at least 30 contiguous nucleotides of the nucleic acid sequence of 

20 the virus of claim 2; and \ 

\ 

(c) at least 50 contiguous nucleotides of the nucleic acid sequence of 
the virus of claim 2. ^ 

\ 

16. An isolated nucleic acid molecule that: \ 
25 ( a ) hybridizes under stringent conditions wi#a nucleic acid probe 

comprising the sequence of claim 15; and 

(b) encodes a protein of claim 6. \ 




17. An isolated nucleic acid molecule encoding a protein of claims 6. 

\ 

18. An isolated nucleic acid molecule encoding all proteins encoded ty^the 
virus of claim 2, and having a biological activity of an RRV virus. N 



19. A method for testing the efficacy of a drug in the treatment of a 
35 condition associated with the virus of claim 2, the method comprising: 

(a) administering the drug to a non-human primate infected with the virus 

of claim 2; and 
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(b) observing the primate to determine if the drug prevents or reduces the 
presentation of one or more symptoms associated with viral infection. 

20. The method of claim 19, wherein the primate is immunocompromised. 

21. The method of claim 20, wherein the drug is for the treatment of 
aposi's sarcoma and lymphoproliferative disorders. 




5 
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22. The method of claim 20, wherein the primate is immuno-compromised 
10 as a result of infection by Simian Immunodeficiency Virus (SIV). 

23. \ The method of claim 19, wherein the condition associated with infection 
with the virus of claim 2 is one or more of B-cell hyperplasia, lymphadenopathy, 
splenomegaly, hypergammaglobinulinernia or autoimmune hemolytic anemia. 

15 \ 

24. The method of claim 19, wherein the non-human primate is a Rhesus 
macaque monkey. 




25. A method for producing a non-human primate model for testing potential 



20 treatments for a condition associated anMnfection with the virus of claim 2, comprising 

(a) administering a treatment to the primate to render the primate 
immunocompromised; and \ 

(b) infecting the primate with the virus of claim 2. 

\ 

25 26 - The method of claim 25, wherein the\condition is Kaposi's sarcoma and 

lymphoproliferative disorders. 




27. The method of claim 25 wherein the treatment^used to render the primate 
immuno-compromised is infection with SIV. 

28. The method of claim 25 wherein the non-human primate is a Rhesus 
macaque monkey. 




29. A method for testing the efficacy of a candidate vaccine againsifthe virus 
35 of claim 2, or conditions associated infection with virus of claim 2, the method 
comprising: 
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(a) administering the vaccine to a subject capable of infection with the 



v virus of claim 2; 

\ 



(b) inoculating the subject with the virus; and 
X ( c ) observing the subject to determine if the vaccine prevents or reduces 

5 an incidence of viral infection or presentation of one or more conditions associated with 
the viral infection. 



10 



30. \ The method of claim 29, wherein the subject is a primate. 



31 . The^method of claim 30, wherein the primate is a non-human primate. 




32. The method^of claim 29, wherein the primate is immunocompromised. 



33. The method of claim29, wherein the conditions associated with infection 
15 include B-cell hyperplasia, lymphadenopatfiy, splenomegaly, hypergammaglobinulinemia 
or autoimmune hemolytic anemia. 



20 




34. The method of claim 31 , wherein the non-human primate is a Rhesus 
macaque monkey. 




